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Preface 


The transition from studying calculus or differential equations to learning about 
proofs is one that is enormously interesting as it shows how exciting mathematics 
can be. The most important aspect of this stage of the transition for students is the 
need for rigorous mathematical reasoning. The benefits to readers who are moving 
from calculus to more abstract mathematics are to develop the ability to understand 
proofs through Problems and Proofs in Numbers and Algebra (PPNA) on their ways 
to analysis, abstract algebra, etc. which come next. Our goal is for students to focus 
on how to both prove theorems and solve problem sets which have depth—multiple 
steps are needed to prove or solve. 

Our approach, “solving rigorous problems and learning how to prove,” gives a 
platform of two specific content themes, number theory and algebra (polynomials), 
with some aspects of applications. Undergraduate mathematics students will then 
learn how to prove and solve problems because they are comfortable with the two 
content themes. Furthermore, our approach is that the content areas of this book 
allow students to develop a natural and conceptual understanding of the mathematics 
on their path forward. Students will study the concept with clarity, precision, and 
a mathematical habit of the mind through these problems. They will gain the 
foundations of mathematical proof techniques and styles. 

The key to the text is its interesting and intriguing problems, exercises, theorems, 
and proofs, showing how students will transition from the usual, more routine calcu- 
lus to abstraction while also learning how to “prove” or “solve”. Its applications such 
as RSA cryptosystems, Universal Product Code (UPC), and International Standard 
Book Number (ISBN) are included in sections of the third chapter. 

Problems and proofs are the heart of mathematics. The goal of conceptual 
understanding grows as a large number of problems and examples that reward 
curiosity and insightfulness over simplicity. The problems are multi-step and require 
the reader to think. An intriguing variety of problems range from moderate to 
thoughtful to deep. 

Each problem set begins with a few easy problems. After that, and in coordina- 
tion with our approach to the subject of depth and conceptual understanding, many 
of the other problems require multi-step solutions, whether they be problems or 
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proofs. In addition, there are exercises in the text; this difference between exercises, 
examples, and problem sets is that the exercises stay close to the examples of 
the section allowing students the immediate opportunity to practice developing 
techniques. 

Furthermore, some problems are motivated from various mathematics com- 
petitions. Dr. Shiue’s significant experience with problems includes constructing 
problems and proofs while he is involved in the American Regional Mathematics 
League (1997-2009), American Mathematical Competitions/MAA (since 1998), 
Taiwan Regional Mathematics League (since 1997), and Taiwan Junior High School 
Mathematics Competitions (since 2002). His work with competitions has enriched 
the quality of our problems. 

Some of the concepts include the following: 


Number Theory, Algebra, Proofs with approaches to them, Division Algorithm, Euclidean 
Algorithm, Greatest Common Divisor, Least Common Multiple, the Remainder Theorem, 
Diophantine Equations and Counting, Equivalence Classes, Divisibility of both Integers 
and Polynomials, Factoring Polynomials and Roots, Matrices (in the plane and in 3-space), 
Cramer’s Rule, and Determinants, among others. 


This text has been revised by the authors over 7 years. An earlier course has 
been used twice at the University of Kentucky (Problem Solving for Middle School 
Teachers) and at the University of Nevada, Las Vegas (four times as a secondary 
resource for future teachers). High school teachers can use the PPNA material in 
their classroom for strong and advanced students through numbers and algebra. 

An advanced math course, Problems and Proof, at the public high school in 
Georgia, Gwinnett School of Math, Science and Technology (GSMST), has been 
a basis for PPNA. PPNA has been revised each spring semester from 2011 to 
2014 by the authors and four Georgia Institute of Technology graduate students. 
Justin Boone has done very well as the individual who not only has helped 
with the typesetting via 41x but also has given us fine advice. We very much 
appreciate the advice of Daniel Connelly, S. Greyson Daugherty, and Nolan Leung 
for their excellence in teaching the PPNA course at GSMST and the help of Scott 
MacDonald, graduate student at UNLV. 

We thank the high school and college students and teachers who worked through 
the various revisions of the draft, who were a pleasure to collaborate with us, and 
who are to continue to make mathematics an even more interesting place. We would 
be happy to receive comments about the book and to respond. Please send to richard. 
millman @ math.gatech.edu, shiue @unlv.nevada.edu, or ekahn @bloomu.edu. 

Dr. Millman’s granddaughter, Bluma Millman, enjoys a wonderful mathematics 
major with algebra and number theory and Sandy has helped much. Dr. Shiue would 
like to thank his wife, Stella, for her full support during the preparation of this book, 
and it is with great pleasure that Dr. Kahn would like to express his full gratitude 
to his wife Emily for her support and encouragement throughout all phases of this 
project. 
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In addition, Dr. Tian-Xiao He, Illinois Wesleyan University, and Dr. William 
Speer, UNLV, have reviewed PPNA much and given us good advice. We thank 
Dr. Derrick DuBose, Chairman, Department of Mathematical Sciences, UNLV, for 
his support. 


Atlanta, GA Richard S. Millman 
Las Vegas, NV Peter J. Shiue 
Bloomsburg, PA Eric Brendan Kahn 
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Part I 
The Integers 


Chapter 1 
Number Concepts, Prime Numbers, 
and the Division Algorithm 


Introduction The topic of this section is the divisibility of integers, the basic 
building blocks (called prime numbers) for integers, and how to apply this foun- 
dation to problem solving in combinatorics and word problems in which integers 
solutions are sought (Diophantine equations). 


Brief Description The operations of addition and multiplication are two ways to 
combine integers to get a third integer (this is called the closure property). We’ ll 
now talk a bit about trying to do the operations in reverse (subtraction and division) 
and discuss the formal definitions which will give rise to the big question of this 
half of the text and the big theorem, Fundamental Theorem of Arithmetic, and its 
applications. 

Addition and multiplication satisfy rules that we expect such as associativity, 
commutativity and the existence of an identity for the operation. More precisely, if 
a,b and c are any integers then 


(a+ b)+c=a+(b+c) and a(bc) = (ab)c (associativity) 


a+b=b+aandab = ba (commutativity) 
Furthermore, there is an integer 0 such that 
a+0=0+a=a (0 is the additive identity) 
and there is an integer 1 such that 


a-l=l-a=a (1 is the multiplicative identity) 


© Springer International Publishing Switzerland 2015 3 
R.S. Millman et al., Problems and Proofs in Numbers and Algebra, 
DOI 10.1007/978-3-319-14427-6_1 


4 1 Number Concepts, Prime Numbers, and the Division Algorithm 


In addition, when we mix these two operations, they respect each other. We have 
two distribution properties: 


(a+b)c =ac+be. 


a(b +c) =ab+ac 


II 


There is one important difference between these operations and that is whether or 
not one can “go backwards” from the identity—the existence of an INVERSE. For 
addition, “go backwards” means that given any integer a, there is another integer x 
(written as —a) such that a + x = 0. In more formal language, there is an additive 
inverse for each integer and that number is actually an integer. We express this fact 
by saying that the set of integers is CLOSED UNDER TAKING ADDITIVE INVERSES. 

For multiplication, “go backwards” would mean that, if a is a given integer, is 
there an integer x such that a times x equals the multiplicative identity, 1 Unlike 
addition, “going backwards” rarely works for multiplication. For example, there 
are no solutions to the equation ax = 1 whena = 0. That is, a = 0 has no 
multiplicative inverse. Furthermore, when there is a solution to ax = 1, that solution 
is not an integer unless a is 1 or —1. 

There is a difference here. While the set of integers is closed under addition 
and taking additive inverses (subtraction), the set of integers is closed under 
multiplication but is not closed under taking multiplicative inverses (division). This 
last fact can be seen easily through another example. The solution to 


3x = 1 


isx = ; but ; is not an integer. Other that +1, multiplicative inverses don’t exist in 
the set of integers. On the other hand, we may still be able to answer the question 
of divisibility which is the subject of Sect. 1.1. 


Question If a and Dd are integers, is there an integer x such that 
ax = b? 


The above question is the motivation for the whole of Chap. |. We will begin by 
treating the topic of divisibility by asking what is the relationship between integers 
a and b which ensures you can solve the equation of one variable x, ax = b, in 
such a way that the solution x, is actually an integer? 

Going further, when does an integer equation in two variables have a solution 
which involves only integers and what are those solutions? Said mathematically, if 
a1, a2 and b are integers with a; # 0 and a ¥ O, are there integers x and y such 
that 


aix+a@y=b? 


Such equations are called linear Diophantine equations and are related to the 
subject of counting as we’ll discuss in Chap. 2. We will then define what is meant 
for elements of a set to be the same, equivalence classes, and apply this approach 
to both clock arithmetic and a rigorous definition of fractions. 
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1.1 Beginning Number Concepts and Prime Numbers 


Overview We start with Z, the collection of all integers, and N, the collection of 
all positive integers. The introduction to Chap. | discusses the operations of integers 
and the building blocks of prime numbers. These symbols will be used throughout 
this text and are common throughout mathematics. 


Definition. If a and b are integers and a 4 0, then a DIVIDES b (or a is a DIVISOR 
of b) if there is an integer c such that 


ac=b., (1.1.1) 


In this case, we also say that (i) a is a DIVISOR of Jb, (ii) a is a FACTOR of b, or 
(iii) b is a MULTIPLE of a. 


Notation. We will write a|b to mean that a divides b or b is DIVISIBLE by a. The 
symbol a + b means that “a does not divide b.” Thus 3]9 but 4 + 9. 


Remark. When a + 0, there is always a real number, c, which is a solution 
to (1.1.1); it is just that the solution is not necessarily an integer. The point of 
divisibility is that the solution c to (1.1.1) is actually an integer. 


Remark. The symbol a|b has nothing (or little) to do with the fraction B To say 
that 3|6 means that 6 is divisible by 3; that is, there is an integer c with 3c = 6 (of 
course, c = 2 solves the equation). On the other hand, 3 + 8 since there is no integer 
c with 3c = 8. There is a fraction (which is not an integer), x = g, with 3x = 8. 
Since § = 24 is not an integer, 3 does not divide 8, 3 + 8. 


The definition of divisibility quickly gives the following two properties stated 
as Exercises 1 and 2 which are used later in the text. The first exercise shows that, 
if an integer divides each of two numbers, then it divides both their sum and their 
difference. 


Exercise 1. If a|b and alc then prove that a|(b + c) and a|(b —c). 


Exercise 2. Let a,b,c € Z. Then prove that 


A. Ifa divides b and b divides c, then alc. 
B. If a divides b and a divides c, then, for any integers k and /, a divides the 
integer kb + Ic. 


Just as substances are made up of molecules and molecules are made up of 
atoms and atoms are made up of electrons, neutrons and protons, so there exist 
basic building blocks of integers which are called prime numbers. We will continue 
to carefully define terms and present the the Fundamental Theorem of Arithmetic 
(Theorem |) which makes the previous sentence mathematically precise. We assume 
that you have already seen prime numbers and so shall not spend much time on 
motivating examples. 


Definition. If n is an integer and n > 1 then is a PRIME NUMBER if the only 
positive divisors of n are | and n. 
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We say 1 is a COMPOSITE NUMBER if n > 1 and not a prime number. 


Suppose we start with a given integer greater than 1, say n = 315 and start to 
look at its factors until we can’t go any further. This is the mathematical idea of 
writing an integer as the product of basic building blocks (prime numbers). 


Remark. How many divisors does a prime number, n, have? The answer is not two, 
it’s four and they are +1 and +n. The definition of a prime number talks about 
positive divisors so there are no contradictions here (this point is often overlooked 
by students). 


There are 5 ways to factor 315 into 2 numbers (ignoring order). Here are 4 of 
them (9 x 35 is the fifth). 


45 7 15 21 5 63 3 105 


Each of these can be further factored to give rise to the trees (they are actually 
called “factor trees”) below. 


Un 
a F 
3 5 3 7 


3 


/\ 
a 


ot 


315 
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45 7 
~ 
9 5 
3 
315 315 
63 
/\ 
21 3 
7% 
3 7 


1.1 Beginning Number Concepts and Prime Numbers 7 


The important principle to be noticed here is that no matter which way we 
continued to factor (and there are more than we have written), the final result always 
has two 3’s, one 5, and one 7, and each of the integers 3, 5, and 7 are prime 
numbers. The Fundamental Theorem of Arithmetic is a formal statement of this 
fact. Its proof requires more advanced material (see References) and so is omitted. 
It is also referred to as “UNIQUE PRIME FACTORIZATION” because 315 = 32-5-7. 


Theorem 1 (Fundamental Theorem of Arithmetic). Each integer n > 1 can be 
decomposed as a product of prime numbers (some of which may appear more than 
once). Furthermore, this decomposition is unique except for the order of the factors. 


The statement of uniqueness means that the prime factors are always the same 
(except for the order we list them) and the number of times (MULTIPLICITY) a prime 
appears in the factorization is always the same. The prime factors of 315 are 3, 5, 
and 7 and 3 appears twice (we say that “3 has multiplicity 2”). 

By collecting like primes and replacing them by a single factor, we can rephrase 
Theorem | as a corollary: 


Corollary 1. Any positive integer n > 1 can be written in exactly one way in the 
form 
oki ok ky 
n= p\' py’... P, 
where fori = 1,2,...,r each k; is a positive integer, called a multiplicity, each p; 


is a prime, and py < pr <... < p,. We call this representation the PRIME POWER 
DECOMPOSITION OF n. 


At this point, we will assume knowledge of the language used to describe a set but 
not any formal set theory properties. We also assume the reader is familiar with set 
notation and the idea that a set is a collection of objects. We will briefly review the 
notations of union, intersection, and empty and disjoint sets. Note that the concept 
of a set was first considered by Georg Cantor (1845-1918). 


Notation. We say x is an ELEMENT of the set A if x is a member of A and we write 
“x € A”, 


To list a set, we write out all of the elements of the set A inside a set of braces 
{, } or give a rule which defines the set. In the notation below, we pronounce the 
symbol “|” as “such that”. For example, let A be the set of integers between | and 
10 inclusive. Then we can write A in two distinct ways: 


A = {1,2,3,4,5,6,7,8,9, 10} 
or 


A={xeEN|1 <x < 10}. 
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While the symbol “|” when used in a description of a set means “such that’, 
remember that “|” when used with two integers means “divides”. Thus 7|21 (which 
is true) is a statement and has nothing to do with constructing a set. We can construct 
new sets from two others (say A and B) in a number of ways. 

The most useful constructions are the intersection of A and B (which consists of 
all elements which are common to A and B) and union of A and B (which consists 
of those elements which are in A or B). 


Definition. The EMPTY SET, written @, is the set with no elements in it. 
Definition. If A and B are sets then the INTERSECTION OF A AND B, written 
AN B, is given by 

AN B= {x|xeAandx e B}. 


If AN B = @ then A and B are DISJOINT. 


It is, for example, easy to see that if A is the set of integers which are multiples 
of 3 and B is the set of integers which are multiples of 2, then AM B is the set of 
integers which are multiples of 6. 


Definition. If A and B are sets, the UNION OF A AND B, written AU B, is given by 
AUB={x|xeAorxe B} 


Remark. The term “or” in the definition is inclusive. In other words, x € AUB 
means either x is in A or x is in B (or x is in both A and B.) 


Definition. If n € Z and n is divisible by 2, then n is called an EVEN INTEGER (or 
just “even’”’). If n € Z is not even, then n is ODD. 


Of course, 1 = 0 is even and so we can represent the sets of even and odd integers 
respectively as: 


Even Integers = {n € Z|n = 2k for some k € Z}, 


and 


Odd Integers = {m € Z|m = 21 + 1 for some/ € Z}. 


Example I. Prove that the sum of any two even integers is even. 


Solution. Let n and m be even. We must show that (n + m) is twice some integer. 
Since both are even, by the definition of “even”, there are integers £, and £ with 


n = 2£,; andm = 2) 


1.1. Beginning Number Concepts and Prime Numbers 9 


Thus n + m = 2€; + 2l, = 2(€; + £2) and son + m is twice the integer £; + 0 
which is the definition of n + m being even. // 


Exercise 3. Prove that the sum of any two odd integers is even. 


One useful observation is that breaking proofs up into a couple of different cases 
can really help simplify the work. Suppose that we have a statement in equation 
form about x and y with at least one of them, say x, taking only integer values. One 
way to break up the proof is to split the possibilities for x into two cases, one for even 
numbers and the other for odd. For example, since no two consecutive integers can 
both be odd, we see that the product of two consecutive integers must be divisible by 
two. Can you write a formal proof of this statement (Problem 3 of Problem Set 1.1) 


Example 2. Find all integers n such that n? — 2n — 3 is prime. 


Solution. The method here is to factor the given polynomial and since the result is 
a prime number, one of the two factors must be +1. We recognize that we can factor 
the expression and write 


n? —2n—3 = (n—3)(n+ 1). 


Since n is an integer then both n — 3 and n + 1 are also integers. Thus, the key is if 
we want n? — 2n — 3 to be a prime number then either n —3 = +l orn +1 = +1. 
Thus, 7 can be 4, 2,0, or —2. We must still check to see if these values give prime 
numbers: 
47-2*4-3=5 
2?*-2*2-3=-3 
0° —-2*0-3 =-3. 
(—2)? —2* (-2)-3 =5 

Since prime numbers must greater than 1, the only solutions aren = 4 
andn = —2. // 
Example 3. Find all x € Z such that x* + 4 is a prime. 


Solution. Trying to factor x* + 4 isn’t at all obvious! There are two steps needed 
to perform the factorization. One is to express x* + 4 as the difference of two 
squares and then factor. The idea is to do some algebra first by adding (4x? — 4x7) 
(which is really zero, of course) to the given expression and then factoring using the 
“difference of squares” formula, which says that for w and z 


w—2=(w—z2(w+ 2). 
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In our case, 
xt44= x44 4x74 4-477? = Ge? 4-2) = Ox)". 
The difference of squares formula then gives 
xt +4 = [(x? + 2) — 2x][(x? + 2) + 25] 
so that 
xe 4 = (? = 2x + 2)? + Ox +2). 


The point is that since we want to know for what x the left hand side is prime, we 
can look for the values of x where exactly one of the terms on the right hand side is 
either +1 or —1. So we are left to find solutions of the four quadratic equations: 


x? —2x+2=1,x7-2x4+2=-1 
x 4+2x4+2=1,x74+2x4+2=-1. 


The second and fourth of these equations yield no integer solutions by the quadratic 
formula. However the first and third equations can be rewritten as: 


(x — 1)* = Oand (x + 1)? = 0. 


So we have two possible x values that will make x* +4 prime, namely 1 and —1. We 
see if we plug either answer in for x, then the calculation yields 5 which is prime. 
The only x that makes x* + 4 prime is when x = +1. // 


Let’s summarize the technique for solving a problem of the kind “find the values 
of x such that the polynomial p(x) is a prime number”. We are motivated by the 
definition of a prime number and so we first do some algebra to get the expression 
P(x) to be the product of two others, then we set each of those two expressions to 
1 and then —1. We thus have a list of all possible solutions to the problem and we 
need to check to see which x values actually give a prime number as an answer. This 
type of problem can be deceptively complex as sometimes very simple expressions 
have difficult answers. For example, are there infinitely many primes of the form 
a? + 1 with a € Z? Number theorists believe that there are infinitely many. (See [4] 
for more information.) 


Exercise 4. Find all x € Z so that 
x = be +95 


is a prime number. (Hint: Show that this expression is the same as 
(x? + 5)? — 16x”. Can you factor the latter in a manner similar to Example 3?) 
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We shall close this section by proving two more facts about primes by using the 
concepts of this section. 


Theorem 2 (Euclid’s Theorem). There are infinitely many primes. 


Proof. Assume that there are only finitely many primes pj, p2,..., Pn. Let m = 
P1P2°** Pn + 1. We know from the Fundamental Theorem of Arithmetic that there 
is a prime p such that p|m. Since p is a prime and py, po,..., Pn are all of the 
primes, there must be a j such that p = p; (j = 1,2,---,n). Now p|pip2:*: Pn 
and p|m. By Exercise 1, we have p|1. This is a contradiction. a 


Proposition 1. [fn is a composite number, then n has a prime divisor p such that 
psn. 
2 


Proof. By hypothesis, n = ab where 2 < a < b < n. Therefore,n > a* or 
a < /n. Let p be a prime such that pla. Such a p exists by the Fundamental 
Theorem of Arithmetic. Thus, p|n and p <a < ./n. i 


For example, 40 is a composite, 2|40, 2 < /40 and 5/40, 5 < \/40. Following 
Proposition 1, we can conclude that to test whether the number 7 is prime, it is not 
necessary to test whether n has a prime divisor greater than ./n. 

For example, is n = 311 a prime? The primes less than /311 < 18 are 
2,3,5, 7,11, 13,17. Since none of these primes are a divisor of 311, we conclude 
that 311 is prime. 


Problem Set 1.1: 


1. (a) List five of the elements of the set 
O={me€Z|m=2l—1 forsome ¢ € Z}. 


(b) How would you describe the set O in words? 
(c) Look at the definition of the set of odd numbers in this text. Is there a 
contradiction between that definition and the defining properties of O? 
. Prove that the sum of an odd integer and an even integer is odd. 
. Prove that the product of two consecutive integers is even. 
. Prove that the only even integer that is prime is the number 2. 
. Find positive integers, p and g, both prime, such that p + g = 313. 
. If is an integer, prove the sum of its square and the square of the integer two 
more than 77 is even. 
7. (a) Suppose that x,y ©€ Z and both are even or both are odd. Prove the 
difference of their squares (x* — y7) is divisible by 4. 
(b) If x, y € Z, x is odd, and y is even, is the same result as 7a true? 
8. Prove that the sum of the squares of any two consecutive integers is odd. 
9. If 3 + n, then prove that (n? — 1) is divisible by 3. 
10. If a and b are odd numbers, then prove that a> — b? is even. 
11. Find all x € Z such that x? — 4x — 5 is prime. 
12. Find all x € Z such that x? — 2x — 15 is prime. 


NnfWN 
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13. Show that there are no odd integers x greater than 1 for which x? + 1 is a prime 
number. 

14. Find all x € Z such that x*+ + x? + 1 is a prime if there are any. 

15. Find all x € Z (or show that none exist) such that x* + 3x? + 4 isa prime. 

16. Find all x € Z (or show that none exist) such that x* — 8x? + l6isa prime. 

17. Find the smallest positive integer 1 such that 


p(n) = n+nt+17 


is prime. 

18. A number of the form 2” — 1 for n € N — {1} is called a Mersenne number 
(named after the French monk, Father M. Mersenne (1588—1648) who did much 
to stimulate mathematical research). It is easy to check that 27 — 1 = 3 and 
23 — 1 = 7 which are primes. However 2!! — 1 = 2047 = 23 x 89 which is 
not prime. A prime number of the form 2” — | is called a Mersenne prime. It is 
unknown whether or not there are infinitely many Mersenne primes, the largest 
known to date is 2°78! __ 1] (See Wikipedia.com as of 2013). Show that 2” —1 
is a composite number if 7 is a composite number. 

19. Let f(x) = x? + x + 41. Show that f(n) is prime for 0 < n < 39, but f(40) 
is a composite number. 

20. By using Proposition 1, determine which of the following integers are prime. 


(a) 403 
(b) 157 
(c) 1999 


1.2 Divisibility of Some Combinations of Integers 


Overview The first two paragraphs of this section consist of two seemingly 
different kinds of results. The first type is to state general results (such as “If an 
integer divides two others then it divides their sum’’). The second sort is to ask the 
reader to find all integers which satisfy certain divisibility criteria (“If n is a non- 
negative integer, find all values for m such that (3n — 8) divides (2n + 3), which is 
Example 8). 

In fact these two types of problems are similar as they both start with using a 
variable name (usually 7) and asking for what n is a condition true. For the first kind 
of problem, the answer is “all n” (or perhaps “all odd n”, “all even n”, etc.). For the 
second, the solution will be a few (or more) values of n for which the criterion is 
true. Perhaps more importantly, the first kind of result (such as “The product of two 
consecutive numbers is even’.) can be used to solve both types of problems. 

The next set of examples and exercises can be approached by giving variable 
names to the integers involved and doing a little bit of algebra. It is useful to 
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remember that if we are dealing with an odd integer n, then n may be written as 
2k + 1 for some k € Z. In Example 1, we used a similar technique for a result about 
the sum of even integers. 


Example 4. Show that the sum of 3 consecutive integers is always divisible by three. 
Solution. Let n € Z. The three consecutive integers are then n,n + 1, andn + 2 
and so the sum of three consecutive integers, called s is 


s=nt(n4+1)+(4+2) =3n4+3=3(n4+1). 


Thus s is 3 times an integer and so 3|s. // 


Note One can also let the three consecutive integers be n — 1,n, andn + 1. 
The generalization of Example 4 is interesting. See Problem | of the Problem 
Set 1.2. 


Example 5. If a and b are odd, then prove that a” — b? is divisible by 8. 
Solution. Since a and D are both odd, there are integers k and / witha = 2k + 1 
and b = 2/ + 1. Thus 
cab =Ok+ 17 =O! +17 
= 4k? + 4k — 41? — 41 
= 4(k? +k —1? —D). 


Thus a? — b? is divisible by 4 but that is not enough to also say it is divisible by 8. 
Factoring the above equation gives: 


a’ —b* = 4(k(k +1) -—4(00 + 1). (1.2.1) 


Now using the insight that k(k + 1) and /(/ + 1) are consecutive integers and so they 
must both be even (divisible by 2), we can write k(k + 1) = 2r and/(/ + 1) = 2s 
with r,s € Z. Substituting into Eq. (1.2.1), 


a’ — b? = 8r — 85 = B(r —s) 


and so a? — b? is divisible by 8. i 


We believe strongly that multiple steps in a problem is most important. This 
example requires multiple steps: understanding the definition of odd, doing some 
algebra, seeing that we are not yet finished (divisible by 4 at this stage), more 
algebra, starring the equation to see that we are multiplying two consecutive integers 
(and so have an even number), and then finishing the problem by combining terms. 
This is the typical method of problem solving—multiple layers with a number of 
insights, separated by some routine (or clever) computations! A problem where we 
just write out an answer doesn’t give insights but multiple levels do. 
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Example 6. Let n € N and suppose that 7 is odd. Prove that 
32 | (n* + 11)(n? + 15). 


Solution. Since n is odd, it is of the form n = 2/ + 1 for some / € Z. The integer 
on the right-hand of the assertion becomes 


(n? + 11)(n? + 15) = (21 + 1) + 11)((22 + 1° + 15) 
= (47 + 41 + 12)(41? + 41 + 16) 
= 160 +1437 +144). 
The second two numbers on the right-hand side are consecutive integers. By 


Problem 3 of Problem Set 1.1, their product must be even, say written as 2k, where 
k is an integer. Thus, since we have k defined by the equation 


(414390 +144 = 2k, 
we see that 
(n? + 11)(n? + 15) = 16(2k) = 32k 


and so 32 | (n* + 11)(n? + 15). / 


Exercise 5. Let € N and suppose that 7 is odd. Prove that 
32|(n* + 4n — 1)(n? + 4n +4 3). 
Exercise 6. Let n € N and suppose n is odd. Prove that 
32|(n? + 3)(n? +7). 


We now use Exercises | and 2 to solve some divisibility problems. 


Example 7. \f a,b and c are integers such that a|(b + c) and a|(b — c), then a|2b 
and a|2c. 


Solution. Exercise | shows that a must divide both the sum of (b + c) and (b —c) 
and their difference. Therefore a|2b and a|2c. // 


Example 8 is the first of the problems that uses Exercises | and 2 (if a divides 
two integers, then it divides both their sum and their difference). It is an extremely 
useful technique when we remember that any integer divides itself! Thus if 3n — 8 
is an integer then (32 — 8)|(3n —8) and so (3n —8) divides any multiple of (31 — 8), 
so in particular, (37 — 8)|(—2)(3n — 8). This very simple (but clever) observation 
lets us solve Example 8 because we end up with (3n — 8) dividing an integer (25) 
which does not contain 7 as a factor. 
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2: 3 
Example 8. Find all values of n such that both n and we are positive integers. 
n — 


Solution. Since any integer divides itself, (3m — 8)|(3n — 8) and the example says 
that (3 — 8)|(2n + 3) by assumption, we use Exercise 2B (with k = 3 and/ = —2) 
to see that (3n — 8) divides 3(2n + 3) — 2(3n — 8) = 25. If 3n — 8 divides 25, then 
3n — 8 must equal +1, +5, or +25. Since wits is a positive integer, there are only 
3 choices available: 


3n —8 = +1,4+5, or + 25 


so 3n must be 9, 13, or 33 orn = 3, B or 11. 
Because n is a positive integer by assumption, the only possibilities left aren = 3 


or 11 and they both work. // 


Here is an alternative solution of Example 8 which doesn’t use Exercise 2B. Let’s 
define m as 


2n+3 _ 


=meN. 
3n —8 


Then 27 + 3 = m(3n — 8) or 3 = 3mn — 2n — 8m. 
To aid in the factoring process, we multiply the last equation by 3 and adding 16 
to both sides gives 


25 = 9mn — 6n — 24m + 16. 


The right-hand side of the previous equation factors into (3n — 8)(3m — 2). Thus 
(3n — 8)(3m — 2) must have the same divisors as 25. The only choices for solutions 
of integer values are summarized in the table which follows: 


3n —8|| 1 | —1 |5|—5|25|—25 
3m — 2|[25|—25|5|—5] 1 | -1 


(This means, for example, that 3n — 8 = 1, and 3m — 2 = 25 is one possibility to 
check out). Examining each possibility gives: 


3n || 9 | 7 |13| 3 |33|—17 
3m ||27|—23] 7 |—3] 3 | 1 


Since both n and m are assumed to be positive integers, there are only two solutions 
remaining 


(n = 3 andm = 9) or (n = 11 andm = 1). 


In other words, n = 3 or 11 are only solutions (as before). 
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Note: Example 8 can also be solved in the following simple way. Since a + eN 
then gets =2+ oy é€ N. Then a € N, hence 3n —8 = 1, 5, or 25 which gives 


n = 3,11,or 2. Thus n = 3 or 11 are the only solutions in N. 
Exercise 7. Let n be a non-negative integer. Find all n such that (2n — 1)|(6” +5). 


The purpose of looking at this example with two approaches is to show the 
ways in which a result like Exercise 2B can simplify proofs. While the second 
method that was used to solve Example 8 is elementary, it required some “tricks” to 
complete. Having proven Exercise 2B gives a structure for looking at this example 
and a platform for proving other results. Problem solving can be “tricks” or can 
use mathematical structure—both are correct. It becomes a choice as to which you 
prefer. 

One should admire the cleverness of the proof or solution of some results. We 
believe, however, that a major aspect of the appeal and beauty of mathematics is in 
its coherence and the way it builds on its foundation. Of course, what each of us 
sees as the beauty of mathematics is very personal. 


Problem Set 1.2: 


1. (a) Let n € N be odd. Prove that the sum of any n consecutive numbers is 
divisible by 1 (Hint: Let x be the first number and write out the sum). 
(b) Is the result of Part (a) true for any even integer of n > 3? 
. Find all integers x such that (2x — 1)|(9x — 4). 
. Find all x € Z such that (3x + 1)|(5x — 2). 
. Assume that n is an integer greater than |. For what values 7 is an integer? 
. Find all positive integers a and b such that ab = a — 5b + 20. (Hint: What is 
(a + 5)(b — 1)?) 
6. (a) Find all positive integers a and b such that ab — a — 3b = 4. 
(b) Find all positive integers a and b such that a(b + 2) = b. 
7. (a) Letm € Z. If m|(21n + 20) and m|(7n + 3) for some n € Z, what are the 
possible value(s) of m? 
(b) Let b € Z and assume b|(b + 8), (b — 1)|(b + 11) and (b — 4)|(3b + 6). 
Find the value of b. 
8. Using Exercise 2B, find all positive integers n such that 
integer. 
9. Using the method of the alternate solution from Example 8 and the simple way 
described in the note following Example 8, find all positive integers n such that 
ual is a positive integer. 
10. Using the method of the alternate solution from Example 8 and the simple way 
described in the note following Example 8, find all positive integers n such that 
out) is a positive integer. 
11. Use all methods to find all possible positive integers n such that 
positive integer. 


2n+1 


nA Wh 


4n—1 
2n+3 


is a positive 


2n+8 is a 
n—7 


1.3. Long Division: The Division Algorithm 17 


12. Use all methods (similar to those above) to find all positive integers n such that 


2n 


——— iS a positive integer. 
3n—5 


13. Let a, b, c, d, e and k be any integers. If e divides (ka — b) and e divides 
(kc — d), prove that e divides (ad — bc). 


1.3. Long Division: The Division Algorithm 


Overview Our next topic is “long division”. It is an idea which you will meet again 
and again during mathematics and possibly a teaching career in many different 
ways. The procedural understanding of how to actually do long division is, of 
course, very important. Conceptual understanding of this process is also crucial. 

Many parts of problem solving in number theory are based on an in-depth 
comprehension of long division. We will work with both an example and a statement 
using the variables a and b with the quotient being g and the remainder r in order 
to show conceptually what long division is. For example, we know that 37 divided 
by 7 has a quotient g = 5 and remainder r = 2. 

If a and b are integers, with a > b and b positive, doing what is usually called 
“long division” is a way to check if b|a. The foundation for long division is to start 
with the integer b, subtract it from a, and then repeat the process. To illustrate the 
procedure, we will use b = 7 and a = 37 as an example. We keep subtracting the 
number b from a and then take b from what remains, again and again. How many 
times? The answer is g times (g € N) until either a = gb (which isn’t the case for 
a = 37andb =7)ora =qbh+r where0 <r <b (q = 5 andr = 2 in the case 
of b = 7 anda = 37). This is formalized by the Division Algorithm, which is so 
important that we don’t give it a number, just a name, and mark it with a star (*). 


The Division Algorithm. Leta € Z, b €N, then there exist integers q andr such 
that 


a=qb+rand0<r <b. (*) 


Furthermore, the integers q andr which solve Eq. (*) are uniquely determined. 


Definition. The integer g in Division Algorithm (+) is called the QUOTIENT of a by 
b and r is called the REMAINDER of a on division by b. Note that b|a exactly when 
the remainder of a by b is zero. We say a is the DIVIDEND and b is the DIVISOR. 


From the sketch of how the Division Algorithm works in the case of b = 7 and 
a = 37 which precedes its statement, an argument can be made (if a > 0) that 
it should be called the “successive subtraction” algorithm, instead of the Division 
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Algorithm. The first three problems of Problem Set 1.3 show the “successive 
subtraction” approach for a > 0. 

Suppose we have two integers n; and nz. We will now look into what the 
remainder of their sum and the remainder of their product on division by b are 
as that will be useful in what follows. Simple algebra shows that if gi, q2,71, and r2 
are integers and (using the conventions of the Division Algorithm) both: 


n= qb+r, withO<r, <b 


and 
Nz = qab +r. withO <r. <b, 
then 
ny tn = (91+ q)b+ (in +12) (1.3.1) 
and 
nny = qigob” + (qir2 + gari)b + rrr. (1.3.2) 


The first term of the right side of (1.3.1) is divisible by b and the first two terms 
of the right side of (1.3.2) are divisible by b. Therefore we have the useful result 
contained in the next proposition. 


Proposition 2. [fn and nz are positive integers whose remainders on division by 
the positive integer b are r\ and rz respectively, then 


(n, +2) has the same remainder on division by b as (r; + rz) has (1.3.3) 


and 


N1N2 has the same remainder on division by b as rir2 has. (1.3.4) 


Proof. The Division Algorithm shows that there are unique integers q1, g2, 71, and 
r2 such that 


ni =qbt+r; withhO <r; <b fori = 1,2. (1.3.5) 
We first prove Eq. (1.3.3) by adding the equations of (1.3.5) to obtain 


ny +n = (git qab+ (1 +17). (1.3.6) 
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However, we can apply the Division Algorithm to the integer (7; + 7). Writing the 
quotient or 7, + rz on division by b as q* and the 


remainder of (r, + r2) on division by b= r* (1.3.7) 
and 
rtrm=q*bt+r* withO <r* <b. 
Substituting into Eq. (1.3.6) yields 
(atm) =(dtatg)bt+r*. (1.3.8) 
Equation (1.3.8) writes the integer (7; +2) as the sum of an integer times 5 plus an 
integer r* withO < r* < b. The quotient and remainder of any integer, in particular, 
(n, + 72) on division by b must be unique so Eq. (1.3.8) shows that the quotient of 
(ny + 2) is (q1 + q2 + q*) and the remainder of (mn, + 12) on division by b is r*, 


which by definition is Eq. (1.3.7), the remainder of (7; + rz) on division by b. 
To prove Eq. (1.3.4) we follow the same procedure. 


nny = qigob? + (qirt + gar2)b + rrr. (1.3.9) 
However, if r =remainder of r,r2 on division by b and gq is its quotient, then 
ryrg = qb+rwhereO <r <b. 
Substituting into Eq. (1.3.9) yields 
nn = (qigbt+qr+g@nitqbtrwithdO<r<b. 


Thus, by uniqueness, r must be the remainder of n,n2 on division by b. Of course, 
g1g2b + giro + qari +¢ is the quotient of n\n by b, but that result is not useful. Hl 
Exercise 8. 


A. Explain why in Eq. (1.3.3) we did not claim that the remainder of n; + nz is 
always equal to 7; + rz. (Give an example.) Hint: Look at the statement of the 
Division Algorithm. 

B. Explain why in Eq. (1.3.4) we did not claim that the remainder of (7172) is 
always r1r2. (Give an example.) 


Example 9. Find the remainder of 
1 = 1841 + 1954 


on division by 6. 
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Solution. Since 1841 = (306)6 + 5 and 1954 = (325)6 + 4, the remainder of 
1 = 3,795 on division by 6 therefore is the same as the remainder of (5 + 4) = 9 by 
6. The answer is 3; that is, the remainder of 3795 on division by 6 is 3. This problem 
gives an example of why the remainder of (7; + 2) is not ry + ro. // 


Example 10. Find the remainder of n = (1841)(1954) on division by 5. 


Solution. Since 1841 = 1840 + 1, the remainder of 1841 on division by 5 is 1. The 
remainder of 1954 = 1950 + 4 is 4, so Proposition 2 says that the remainder of their 
product n, is the product of (4)(1) = 4 (since 4 < 5). // 


Example 11. Find (a) the remainder of n = (40)° on division by 19; (b) the 
remainder of n = (40)!° on division by 19. 


Solution. 


(a) The remainder of 40 on division by 19 is 2. Repeated application of Propo- 
sition 2 shows that remainder of (40)° on division by 19 is the same as the 
remainder of 2° = 32 on division by 19 and so the answer is 13. 

(b) Since the remainder of 40 on division by 19 is 2, the remainder of n = (40)!° 
on division by 19 is the same as the remainder of 2! = (2°)? by 19. Because 
we know the remainder of 2° by 19 is 13 from part (a), we need only find the 
remainder of (13)? = 169 by 19. Since 169 = (19)(8) + 17, the answer is 17. 
This problem gives an example to show why the remainder of (1172) need not 
be rfo. 


// 


In the Division Algorithm, we have assumed that the divisor b is positive but here 
we made no assumption about the dividend a. What then is the interpretation of the 
Division Algorithm if a < 0? Let’s start with an example. 


Example 12. What is the quotient and remainder of —5 on division by 2? 
Solution. The precise conclusion of the Division Algorithm means that we must 
find g € Zandr € N U {0} with 

—-S5=q2+r (1.3.10) 


where 0 < r < 2. In the interpretation of the case in which a and b are both positive, 
we used subtraction of b from a. However, subtraction of 2 from —5 gives —7. We 
are going in the wrong direction to get the quotient and remainder of —5 on division 
by 2. Instead, addition will help: 
—5+b = —3 but —3 is not between 0 and 2 (1.3.11) 
—3 +b = —1 but —1 is not between 0 and 2 (1.3.12) 


—1+b=1=r which satisfies 0 < r < 2. (1.3.13) 
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How many additions were there? There were 3 so we are led tog = —3 andr = 1. 
Putting these values for g and r into Eq. (1.3.10) gives 


—5 = (-3)2+1 


which checks. Thus the remainder is 1 and the quotient is —3. Note that there is no 
assumption that the quotient must be positive in the Division Algorithm. // 


oo 


Problem Set 1.3: 


. (a) What are the quotient and remainder of 63 divided by 17? 


(b) Follow the discussion at the beginning of this section and first subtract 17 
from 63, then subtract 17 from your answer, etc. until you end up with 
an integer which is at least 0 but less than 17. Show that the number of 
subtractions is the quotient of 63 by 17 and the remainder is the number 
between 0 and 17. 


. (a) What are the quotient and remainder of 35 by 7? 


(b) Repeat part b of Problem | above. 


. (a) With the help of Example 12, what are the quotient and remainder of —27 


by 6? 
(b) Repeat part b of Problem | above. 


. (a) What are the quotient and remainder of —7 divided by 3? 


(b) What happens if you try to interpret the quotient of —7 by 3 using 
successive subtractions? 

(c) What happens if you try to interpret the quotient of —7 by 3 using 
successive additions? 


. (a) What are the quotient and remainder of —5 divided by 3? 


(b) What happens if you try to interpret the quotient of —5 by 3 using 
successive subtractions? 

(c) What happens if you try to interpret the quotient of —5 by 3 using 
successive additions? 


. (a) What are the quotient and remainder of —32 divided by 6? 


(b) What happens if you try to interpret the quotient of —32 by using successive 
subtractions? 

(c) What happens if you try to interpret the quotient of —32 by 6 using 
successive additions? 


. Without using a calculator, 


(a) find the remainder of (12345)(678) on division by 13. 
(b) find the remainder of (30)° on division by 13. 


. Find the remainder of n = (12345)(678) + 30° on division by 13. 
. Without using a calculator, 


(a) find the remainder of (15743)(365) on division by 7. 
(b) find the remainder of (50)!° on division by 49. 
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Find the remainder of n = (15743)(365) + 50! on division by 7. 

Find the remainder of n = (1841)(1954) + 40!° on division by 11. 

Suppose that y is the sum of squares of three consecutive integers. Prove that 

the remainder of y on division by three is 2. 

Suppose that y is the sum of squares of any four consecutive integers. Prove 

that the remainder of y on division by four is 2. 

(a) Given the results of Problems 12 and 13, do you think it is reasonable to 
make the following conclusion? 


Conclusion Let y be the sum of squares of consecutive integers (n > 2). 
Then the remainder of y on division by n is 2. 

(The answer to this question is yes or no! That is, yes, it sounds reasonable 
or no, it doesn’t sound reasonable.) 

Is the above conjecture true? If so then prove it, otherwise find a counterex- 
ample. (Hint: Try n = 5.) 

Fact!: There is a formula due to Jakob Bernoulli (1654-1705) that gives the 
sum of squares. If n > 2 is an integer then 


(b 


wm 


n(n — 1)(2n — 1) 


427? 4+---4(n-1P = - 


Suppose y is the sum of the squares of n consecutive integers n > 2 and x is 
the first integer. 


(a) Show that 


n(n — 1)(2n — 1) 


y=nx? +n(n—1)x+ F 


Hint: Use Bernoulli’s formula of Problem 14 for the sum of squares above. 
(b) Show that the remainder of y on division by n is the remainder of r = 
ee on division by n (for example, 1 = 4 givesr = 14, so the 
remainder of y on division by n is 2). 
(c) Can you conclude from the formula (a) that y is divisible by 1? Why or 
why not? 


1.4 Tests for Divisibility in Base Ten 


Overview Here’s a general class of problems to solve: how can we determine 
whether a given (small) integer can divide the integer n? This is the “problem of 
divisibility”. This subject is another example in which problem solving depends on 


'This fact can be proven easily if the reader is familiar with mathematical induction. 
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establishing an approach (a foundation) from which we can directly solve problems. 
This section makes some definitions and then solves the problem of divisibility by 
all integers up to 13. 


Divisibility Problem Overview How can we determine when an integer k divides 
another integer n? What properties of guarantee that k divides n? 

The answer appears by looking at combinations of digits in our base ten 
representation of n. This section uses the base ten representation of integers to come 
up with criteria for divisibility by many small integers. 

For what k can we come up with a test for whether k divides n? The criteria 
of the tests for divisibility will involve the sum and difference of the digits that 
form the base ten representation. In the rest of this section, we will present tests for: 
k = 2,3,4,5,7,8, 9,11, 13 and all powers of 2 and 5. The list is more complete 
than it looks because for k = 1, there is nothing needed since for any n, k is a 
divisor! Furthermore, 6 = 2-3 and so we can use the test fork = 2 andk = 3 
to investigate divisibility by 6. Since an integer is divisible by ten exactly when its 
last digit is zero, we will have test for divisibility by all integers from 2 to 13 as 
well as some others. See the problems at the end of this section. We will see in 
other examples (for instance, Example 17) that to prove divisibility by a composite, 
it suffices to check divisibility by its prime divisors. 


Definition. The collection of integers ao, a),...,a; each of which is between zero 
and nine is called the BASE TEN REPRESENTATION of 7 if Eq. (1.4.1) is satisfied. 
The integer n is written informally as n = agdg_1-+++d2a\do and ag,...,do are 
called DIGITS. In particular, dg is called the LAST DIGIT of n. More formally, 


n = aj10! +.a)_-110'"! + --- + a210? + a, 10! + ao and a # 0. (1.4.1) 


The tests below are named by the integer that we hope will divide the given 
number. Except for Test 2, each test is illustrated by an example before its proof is 
given. At that time, we restate each test to fix the notation of the proof. 


Test 2 A positive integer n is divisible by 2 if and only if the last digit of n is even. 
In general, n € N is divisible by 2* if and only if the last k digits of n are divisible 
by Qk Equivalently, let the base 10 representation of n be n = ajaj-; ...do. Then 
2* \n if and only if 2* |a,_ ...ao. 


Exercise 9. Use Test 2 to decide if 1336 is divisible by 2. Is it divisible by 4? Is it 
divisible by 8? 


Test 5 A positive integer is divisible by 5 if and only if the last digit is divisible by 
5 (i.e. the last digit is either 0 or 5). In general, n € N is divisible by 5* if and only 
if the last k digits of n are divisible by 5*. 


Example 13. Show that 375 is divisible by 5, 25, and 125. 


Solution. Since 5 = 5!, 25 = 5”, and 125 = 53, we need to determine, thanks to 
Test 5, whether 5|5, 57|75 and 5°|375, all of which are true. Thus 375 is divisible by 
5, 25 and 125. // 
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Restatement of Test 5 If the base 10 representation of 1 is n = agag_)---a2a\d 
then 5“ |n if and only if 5“|ax—jax—2 +++ do. 


Proof. We will leave the case when k = 1 and k = 3 as Exercise 10 and 
work through the case when k = 2. The general case can easily be proven using 
mathematical induction if the reader has studied that concept. When k = 2, the 
theorem (assuming £ > 2) says that 25|n if and only if 25 divides the integer 
represented by a;do which is 10a; + ao. From (1.4.1) 


n = (ac 10° + ap, 10! +--+ + a210°) + (10a, + ao). 


But every term in the first parenthesis has a factor of 100. Thus 25|n if and only 
if 25|(10a, + ag). Since 10a; + ao has as its base 10 representation a,ao, we are 
finished. H 


Exercise 10. If the base 10 representation of n ism = agag_; ---d24,do, prove that 


A. 5|n if and only if 5|ao. (Test 5 fork = 1) 
B. 125|n if and only if 125 divides the integer whose base ten representation if 
a7a\do. (Test 5 for k = 3) 


The next three tests involve looking at the sums (or differences) of the digits that 
represent the given number, 7, in base 10. Before starting these tests, we will need 
some terminology. There is a slight technicality depending on whether there is an 
even number or an odd number of digits in the base ten representation of n. 


Definition. If € N is represented as a base 10 number as in Eq. (1.4.1) via 
nN = ajqaj—- 1 +*+:a2a\ao9 


then the DIGIT SUM OF 7 is aq] + aj_) +++: +42 +a; + ao. If / is even, then 
the EVEN DIGIT SUM is dg + G2 +44 +--+: + ae_-2 + a 
and 
the ODD DIGIT SUM Is ad; + a3--- + ade_3 + Ge_1. 


If / is odd, the even digit sum and odd digit sum are defined analogously (but 
end with a;_; and qa; respectively). The even digit sum always includes do and the 
other even-indexed digits, whereas the odd digit sum includes a; and the other odd- 
indexed digits. 


Example 14. Ifn = 3574 then there are 4 digits in the base 10 representation of n 
so that £ = 3. The digit sum is 19, the odd digit sum is 10, and the even digit sum is 
9. If m = 3,556, 143 then / = 6 and the even digit sum is 3 + 5+ 1-+3 = 12 and 
the odd digit sum is 5+ 6+ 4 = 15. 
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Test 3/Test 9 (Test for divisibility by 3 or 9): A positive integer n is divisible by 3 
if and only if its digit sum is divisible by 3. The integer n is divisible by 9 if and 
only if its digit sum is divisible by 9. 


Example 14 (continued) Since the digit sum of n = 3574 is 19 and 19 is not 
divisible by 3, 3 does not divide 3574. Since the digit sum form = 3,556, 143 is 
27, m is divisible by 3 and 9. 


Restatement of Test 3/9 Let n be represented as n = agae_| ---d2d\d9. Then we 
have the following two results: 

A. 3 divides n if and only if 3 divides (ag + ae_) +---+ a2 +a); + a) 

B. 9 divides n if and only if 9 divides (ag + ag_) +--+ + a2 +a, + a0). 

Proof. Part B will be left as an exercise (Problem 6 of Problem Set 1.4) since its 
proof is very similar to Part A. We’ll now show that the remainder of n on division 
by 3 is the same as the remainder of the digit sum of n on division by 3. Note that 
the remainder of 10 on division by 3 is 1. Hence, by Proposition 2, the remainder 
of 107, 10°, etc. is also 1. Since n is given by Eq. (1.4.1) invoking both Eggs. (1.3.3) 
and (1.3.4) shows that 


remainder of 7 on division by 3 = (remainder of 10°) (remainder of ae)+ 
(remainder of 10°~!) (remainder of ay_)) +--+ 


+(remainder of 107)(remainder of a) 


+ (remainder of 10) (remainder of a,) + remainder of do. 
(1.4.2) 


But the first term of each product of Eq. (1.4.2) is one from the first paragraph of the 
proof so the remainder of 7 on division by 3 is the remainder of ag + ae_; +--+ 
on division by 3. a 


Included in the proof is the stronger result which we give as Corollary 2. 
Corollary 2. 


A. The remainder of n on division by 3 is the same as the remainder of the digit 
sum of n on division by 3. 

B. The remainder of n on division by 9 is the same as the remainder of the digit 
sum of n on division by 9. 


Using Corollary 2B is also called the method of “casting out nines”. Many grade 
school children are fascinated by this test. 
Exercise 11. Find all values of x in each of the following. 


A. 3|435x17. 
B. 9|435x7. 


Test 11 A positive integer n is divisible by 11 if and only if the difference between 
its even digit sum and odd digit sum is divisible by 11. 
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Example 15. Letn = 6853. Then the even digit sum is 3 + 8 = 11, the odd digit 
sum is 5 + 6 = 11 and the difference is 11 — 11 = 0. Because 11 divides 0, 11 
divides 6853. 


Remembering that zero is divisible by 11 (since 11-0 = 0), do the following 
exercise. Of course, zero is divisible by all integers except 0. 


Exercise 12. Which of the following integers are divisible by 11? 


A. 2,343 
B. 9,190,718,290 
C. n = 34c67c41 


We’ ll restate Test 11 in the case where £ is even. When £ is odd, the proof is the 
same—it is just the notation that is slightly different. 


Restatement of Test 11 If the base 10 representation of n isn = agag_) ---a2a\ a 
then 11|n if and only if 


11|((ae + deo + +++ + dp + a0) — (Qe-1 + ae-3 + +++a3 +41). 


Proof. We'll use the simple observation that 10 = 11 — 1 with the obvious fact that 
we can rewrite Eq. (1.4.1) as 


n =aj(11—1)) + a)_-1(11— 1)! +... + a)(11 — 1) + a. (1.4.3) 
Note that any power of (11 — 1) has 11 in each term except the last one which is 


+1 (expand the term (11 — 1)* using algebra without simplifying). More precisely, 
there are integers by € Z with 


d1— 1 = 11(some integer) + 1 = 11h, +1 


where +1 is used if k is even and —1 is used if k is odd. When / is even, for example, 
Eq. (1.4.3) becomes 


n = a,(11b; — 1)! +. aj, (1b) — 1) 14+... 4a, (1b) —1) +49 (1.4.4) 
or 


n = 11(aj;b) + ay-ybj-) +... + a, d1) 
(1.4.5) 
+(a; —aj-) +... + a2 —a, +40). 


Thus 11|n if and only if 11 divides the alternating sum of the base 10 representation 
of n which is Test 11. The case in which / is odd is exactly the same. ai 


Because it is used in practice, we isolate the definition implicit in this proof. 
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Definition. The ALTERNATING DIGIT SUM OF n is the difference of the even digit 
sum and the odd digit sum. 


Definition. Suppose that n = a 10! + ay_,10!“! +. --- + ao so that n is written as 
aj aj—1 +++ Qo in base ten representation. Consider the ASSOCIATED SEQUENTIAL 
TRIPLE SUM for n, s? (n), defined as the finite sum: 


s™ (n) = (ap + 10a, + 10°a2) — (a3 + 10aq + 10°a5) 
+ (ag + 10a7 + 107ag) — (ay + 1019 + 107411) + °° 
Let’s look at Example 16A more closely to see a pattern which makes it much 
easier to follow the definition of the sequential triple sum. In that case when n = 
176, 182, 461, writing vertically, 
s’(n) =461 
—182 
+176 
455 
In other words, starting with the integer 1, we look at the 3 digits between each of 


the commas and form their alternating sum. This observation will be useful to you 
as you work homework problems. 


Test 7/Test 13 17 is divisible by: 


(a) 7 if and only if s7 (n) is divisible by 7. 
(b) 13 if and only if s7 () is divisible by 13. 


Example 16. 


A. Determine whether n = 176,182,461 is divisible by 7, without the use of a 
calculator. 
B. For what values of a is 


n= 14, 4a6, 3a2, 121 


divisible by 13? 
Solution. 


A. The associated sequential triple sum for 7 is 
s™(n) = (1 + 6(10) + 4(100)) 


— (2 + 8(10) + 1(100)) + (6 + 7(10) + 1(100)) 
= 461 — (182) + (176) = 455. 
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Since 7/455, the original is divisible by 7. (The power of this method is that 
we’ve reduced the question of divisibility of a 9 digit number to that of a 3 digit 
number.) 

B. Once again, forming the associated sequential triple sum gives 


II 


s’(n) = 121 — (3a2) + (4a6) — 14 
107 — (300 + 10a + 2) + (400 + 10a + 6) 


= 211. 


Since 211 is not divisible by 13, there are no values of a for which the given n 
of Part B is divisible by 13. 


// 


Since the notion can become cumbersome (to say the least!) we’ll now prove 
Test 7/13 only for the case of a 6 digit number (so £ = 5). What is astounding is that 
both of these tests can be proven simultaneously because of the clever observation 
that 


1001 = 7(11)(13). 


Restatement of Test 7/13 Suppose that 7 is a six digit number whose base ten 
representation is 1 = a5a4a342Q, do and write 


s™(n) = (ao + 10a; + 1072) — (a3 + 10a4 + 107a5). (1.4.6) 


Show that: 


(a) n is divisible by 7 if and only if s7 (1) is divisible by 7 
(b) n is divisible by 13 if and only if s7 (1) is divisible by 13. 


Proof. We will use the observation that 
10° = (1001 — 1) = (7-11- 13)—1. (1.4.7) 
We’ ll start with n = a5a4a3d2d1do so that 
n =(do + 10a; + 107a2) + (1023 + 1044 + 10°as) 
= (ay + 10a, + 107a2) + 107(a3 + 10a4 + 107a5) 
= (dy + 10a; + 107a2) + (1001 — 1)(a3 + 10a4 + 10745) 


= (do + 10a; + 10°a2) + (7- 11+ 13)(a3 + 10a4 + 10745) 
— (a3 + 10a4 + 107a5) 
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where the last equality follows from Eq. (1.4.7). Thus, from the definition of the 
sequential triple sum and Eq. (1.4.7), we get 


n= s'(n) + (7-13-11)(a3 + 10aq + 107a5). 


The second term of the right hand side is always divisible by both 7 and 13 (and 11) 
so n is divisible by 7 (or 13) if and only if s(n) is divisible by 7 (or 13). In fact, 
we’re showing that the remainder of n on division by 7 (or 13) is the same as the 
remainder of s/ (1) on division by 7 (or 13). A truly remarkable proof! a 


It’s important to note two points. First of all, the associated sequential triple 
sum really reduces the question of divisibility of the given number by 7 (no matter 
how many digits it has) to the divisibility of a much smaller (frequently three digit) 
number by 7—a much more manageable question. 

Secondly, without the theory developed in the above tests, either part of 
Example 16 would be very hard. It is the foundation which we have developed 
(Test 7/13) which allows for easy problem solving. Once again, an established 
theory is a tremendous help in solving problems and can reduce the problem to 
an easy exercise. 


Exercise 13. Determine whether the integer of Exercise 12B is divisible by 13. 


To do a problem involving divisibility by a composite number, one may check 
divisibility by divisors that are prime numbers (and their powers if needed). That is 
what the Fundamental Theorem of Arithmetic (Theorem 1) enables us to do. Recall 
that we used this major theorem in the case when we divided by 6 = 2-3. 


Example 17. Let n be represented in base 10 as 12a3b so that n = 104 + 2(10°) + 
a(10*) + 3(10) + b. Find all digits a and b such that 15|n and find n. 


Solution. Note that 15|n if and only if 3|n and 5|n. We will use Tests 3 and 5 on 
the integer n = 12a3b whose digit sum is s(n) = 6 + a + b. If 5|12a3b then Test 
5 says there are only two possibilities, b = 0 or b = 5. 

If b = 0, then s(n) = 6+ a. Thus, when b = 0, 3|n if and only if 3|(6 + a) 
which simplifies to 3|a. Thus a = 0,3,6,9. Therefore, the solutions for 15|n when 
b = Oare 


n = 12030; 12330; 12630; or 12930. 


If b = 5, a similar argument shows that a = 1, 4,7. Put differently, the solutions 
for n (with b = 5) are: n = 12135; 12435; or 12735. 

All the possible values for a and b and the corresponding number n are 
summarized in the table below: 


a 0 3 6 9 1 4 7 
b 0 0 0 0 ) 5 5 
12, 030] 12, 330) 12, 630)12, 930) 12, 135)12, 435}12, 735 


= 


// 
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Exercise 14. If 3 divides n = 34a56b and 4 divides n find the digits a and b. If 
there is more that one solution, find them all. If there are none, explain why. 


Example 18. If 44|1985c63d find all possible values for the digits c and d. 


Solution. Now 44 = 4.- 11 and so we’ll start with 4 (the “easier factor”). Since 
2? = 4 divides the number, its last two digits must be 32 or 36 so either d = 2 or 
d=6. 

We now use Test 11 and n = 1985c63d. We see 11|n exactly when 11 divides 
(1+8+¢+3)-(9+5+6+44)) or I1|(c—d —8). If d = 2 then 11|(c — 10) 
which is impossible if c is to be a digit (since c < 9). 


The case in which d = 6 yields a solution. When c = 3 (and for no other 
digits) 11|(¢ — 14) so the only solution is c = 3 and d = 6 (which means that 
n = 19,853, 636. // 


A word of caution about the tests given in this section. They depend critically 
on the representation of the integer n being base-10. For example, the number 4 is 
divisible by 2 and so its base-3 representation must also be divisible by 2. If you have 
already seen base-3 arithmetic, because, 4 = 1-3 + 1, so the base-3 representation 
of 4 is 11. The last digit of the base-3 representation of 4 is not divisible by 2. 
This example does not contradict Test 2 because Test 2 only deals with base-ten 
representations. We now know that 


11 (base-3) = 2 (base-3) times 2 (base-3) 
or 
113 = 23 times 23. 


and so 2 does divide 11 (base-3). Whether a number (2 in this case) divides another 
(4 in this case) is independent of what the base of that number is. The test may 
change depending on the base whereas whether a number is divisible by another 
does not change. 

In the remainder of this chapter, we will discuss the divisibility tests when the 
number given in a base other than 10. 


Problem Set 1.4: 


1. Ifn is written as 12a4c in base 10, find all digits a and c such that 15 divides n. 

2. If 33|1a245, find all possible digits a and b. 

3. If 44|1985ab38, find all possible digits a and b. 

4. Ifn is the 6 digit number m = a2b241 and 117|n, find all possible values for 7. 
(Hint: 117 = 13-9.) 

5. Prove Test 10: the integer 7 is divisible by 10 if and only if its last digit is zero. 

6. Prove Test 9. 

7. If 45|41a455, find all possible digits a and b. 


12. 


13. 


14. 
15. 
16. 


17. 
18. 
19. 


20. 
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. If 36|12a49), find all possible digits a and b. 
. If 48 divides n = 18a5b6 and 11 divides n, find all possible digits a and b. If 


there are none, explain why. 


. If 98 divides m = 62clld and 3 divides m, find all possible digits c and d. If 


there are none, explain why. 


. If 63 divides n = 2266ab and 4 divides n, find all possible digits a and b. If 


there are none, explain why. 

If 12a3b6 is divisible by 21 and b > a, find all possible digits a and b and 
show there are no more. 

Let n be given and m be the integer whose base 10 representation reverses the 
digits of n. Prove or disprove each of the following. 


(a) If a|n then a|m. 
(b) If 11|n then 11|m. 
(c) If 2>|n then 2>|m. 
(d) If 7|” then 7|m 


If n € N, show that 6 divides 7” — 1. Can you generalize this result? 

If p and q are primes and p  q, write a test for divisibility by pq. 

Find all “palindromic” 5-digit numbers which start with 2 and are divisible by 
18. (A number is a palindrome if it reads the same backwards as forwards; that 
is, if its first and last digits are the same, second and next to last digits are the 
same, and so on. For instance, 464 and 627726 are palindromes.) 

Prove that any palindrome with an even number of digits is divisible by 11. 
Let abcd be a 4-digit positive integer. Prove that 11|(abcd + bcda). 

Without using a calculator, determine which of the following numbers is a 
multiple of 9.(a) 247023846 (b) 645 x 7329 (c) 986° + 814° 

Find all choices for the digits a and c so that n, which is a five digit number 
represented by a56c2, is divisible by 9. 


1.5 Binary and Other Number Systems 


Overview This section is not necessary nor dependent on other sections, but it is 
quite interesting. In this section, we look at arithmetic in a base b which is not the 
usual b = 10 basis. Examining numbers in a base other than ten gives depth to the 
understanding of the basic concepts of arithmetic and expands our minds. We will 
finish this section by looking at what happens to the divisibility tests in base ten of 
Sect. 1.4 when a base not equal to ten is used. 


The most common and simplest representation of integers is the DECIMAL (or 


BASE TEN) system, using the ten digit symbols 0, 1, 2, 3, 4, 5, 6, 7, 8, and 9. In 
this system, the digits of an integer represent multiples of powers of 10 (including 
10° = 1). For example: 
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2005 = 2-107 + 0-10? + 0-10' +. 5- 10° (10° = 1) 
678 = 6-10°+7-10' +8- 10° 
and so on. 


In general, any integer can be uniquely expressed in the form a,10” + 
Gn—10"—! + ++» + a210* + a,10! + ag for some n, where each digit a; satisfies 


0 <a; < 9fori = 0,1,...,n. This can be expressed as d,dy—| ...a2a,d. In 
short, 
n 
Any —1 «424,49 = Y ax - 10k (1.5.1) 
k=0 


10 is said to be the base of this system and we may write (a;,dyj—1---d1|adq)\0 to 
emphasize the base. 

Another important system is the BINARY system. This system is significant 
because it is used in computers. Every nonnegative integer has a binary representa- 
tion similar to Eq. (1.5.1) with powers of two replacing powers of 10. 


Dnbm—1...bxb\bo = So be 2 by =Oo0rl. (1.5.2) 
k=0 
The digits b;, are the binary bits, and are thus called BITS. 
We shall write the left side of expression (1.5.2) also as 
(bmbin-1 tee byb1bo)2 


if there is any ambiguity. We now define a number system for any base. 


Definition. If 7 is a positive integer the BASE-r REPRESENTATION of the whole 
number ¢ is given by the m + | integers bo, ..., bm such that 


m 
— > byr*, and 
k=0 


0 <b; <r for0 <i <m,and 


bin # 0. 


The base-r representation of f is written as t = (bmDm—1-+-b2bibo);. 


We will look at base-r where r = 2 (binary), r = 12 (duodecimal), andr = 16 
(hexadecimal). Hexidecimal problems are in Problem 4. 
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1.5.1 Conversion Between Binary and Decimal 


It is helpful to have a systematic way of converting from binary to decimal and 
vice-versa. It is easier going from binary to decimal using a method which looks 
like “synthetic division” or “Horner’s Method” from high school algebra. 


Example 19. Write (10111). in decimal notation. 
Solution. Expressing (10111). using Eq. (1.5.2), 


(10111), = 1-24+0-23?4+1-2'+1-2° 
= 164+4+2+1 
= 23 


The above calculation can be written in the same style as in “synthetic division”. 
Note that in the following table we add each number in a column and then double 
that total and move it to the next column. 


a -i & th a 4 


// 
Exercise 15. Write (10101) and (11110), in decimal notation. 


1.5.2. Conversion from Decimal to Binary 


Example 20. Write the base-10 number 45 in binary notation 


Solution. Take the highest power of 2 that is less than or equal to the given number, 
45. This is 32. Write 45 = 32 + 13. Take the highest power of 2 that is less than 
or equal to 13; this is 8. Write 45 = 32 + 8 + 5. Repeat until we obtain 45 as a 
sum of powers of 2. Thus 45 = 32+ 8+4+41 = 2? 4 23 +2? 4 2°. Hence, 
45 = (101101)p. // 


Alternative Solution There is a different method involving a repeated division 
algorithm which gives a different way of thinking of the solution to Example 20. 


Divide 45 by 2 45 = 22-241 
Divide the previous quotient, 22, by 2 22=11-2+0 
Divide the previous quotient, 11, by 2 11=5-2+4+1 
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Divide the previous quotient, 5, by 2 5=2-:2+1 
Divide the previous quotient, 2, by 2 2=1-2+0 
Stop when the previous quotient is 1 
Obtain the digits for the binary number by reading from bottom up, starting with the 


last quotient, and then the remainder, which gives (101101)2. The above procedure 
can be rewritten as follows: 


ZS 
D2 
2/11...0 
eee 
22d 
is20 


/ 


Exercise 16. A. Write the base-ten number 451 in binary notation. 
B. Write the base-ten number 5001 in binary notation. 


1.5.3. Arithmetic in Binary Systems 


The operation in binary systems is essentially the same as in decimal systems. We 
shall discuss addition, multiplication, subtraction, and division. 
Addition of Binary Numbers 


The addition in base-2 is constructed using the following facts.0+0 = 0,0+1 = 1, 
1+0= 1,and1+ 1 = 10 (that is, write 0 and carry the 1) i.e. 


+]0 1 
0. | 200 
1 1 10 


This means we could say “1 + 1 = 10 because we carry a 1” in base-2! 
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Example 21. Add (10110), and (1010)>. 


Solution. 
1 0 1 1 O 
+ 1 0 1 O 
1 0 0 0 0 0 
In binary notation, (10110). + (1010). = (100000)>. // 
Example 22. Find (101101)2 + (1100)2. 
Solution. 
Pe Qh Ae I {OF = 
+ 1 1 0 O 
1 1 1 0 0 41 
So (101101)2 + (1100). = (111001)>. // 


Exercise 17. Evaluate 


(a) (11011)2 + (11001)2 
(b) (110111)2 + (1100)2 + (110111)2 
Multiplication of Binary Numbers 


The rule for decimal multiplication also holds for the multiplication of binary 
numbers. It is based on the following facts:0-0 = 0,0-1=0,1-0=01-1=1 


x 10 1 
ie. O} 0 O 
1/0 1 


Example 23. Calculate ((101)2)’. 
Solution. We need to write out (101)2 - (101)2. Since (101). = 2? + 1, 


((101)2)” = (2? +12? +1) = 2442-2 41524241 


= (11001)p. // 

Binary multiplication is done this way (called EXPANDED NOTATION) in elemen- 
tary school mathematics. We can rewrite the above computation in more compressed 
form (called the FINAL ALGORITHM) as follows. Note the “carrying” from the third 
column to the fourth. 
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1 0 1 
x 1 0 1 
1 0 1 
0 O O 
+ 1 0 1 
1 1 0 0O 1 


Exercise 18. Evaluate 


(a) (10110)>- (1011)> 
(b) (101011)>-(1011)> 


Subtraction in the Binary System 
Recall that subtraction is the inverse operation of addition. Therefore we have: 0 — 


0 = 0,1-—0= 1,and1—1=0,0—1 = 1 (with borrowing | from the next column 
if possible) i.e. 


SS iOee st 
0 | 0 1 (borrowing | from the next column). 
1 1 0 


Example 24. Evaluate (1101)> — (1011)>. 


0 2 

1 1 
Solution. _ | f Af 1 

0 oO 1 0 
So, (1101)2 — (1011). = (10)o. // 
Example 25. Find (11101). — (10011)>. 

0 2 

1 1 1 
Solution. _ | 0 f f 1 

0 1 0 1 =0 
So, (11001) — (10011)2 = (1010)p. // 


Exercise 19. Evaluate (111101)2 — (110111)2 and (101101). — (1111)o. 


Division in the Binary System 


Division in base-2 will use the same long division procedure as we used in base-10. 
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Example 26. What is (11101)> ~ (101)2? 


Solution. 


101 

101 )11101 
101 

“1001 

101 

~ 100 


Thus (11101). + (101)2 = (101)2 as quotient with remainder (100)o. // 
Exercise 20. Evaluate 


A. (1111112 + (101101). 
B. (111111010010)> = (101101)>. 


1.5.4 Duodecimal Number System 


If b = 12 then the number system is called the DUODECIMAL system. Again the 
digits involved are elements of the set {0, 1,2, 3,4,5, 6,7, 8,9, T, E}. In duodeci- 
mal, T denotes 10 and £ denotes 11. 


Conversion from Decimal to Duodecimal System 
Divide n and each succeeding quotient by 12 until a zero quotient is obtained. The 
sequence of remainders in reverse order yields the duodecimal representation of n. 
Example 27. Let n = 275 in base 10. What is the number of the duodecimal 
representative? 
275 = 22(12)+ 11 
22 = 1(12) + 10 
1=0(12)+1 
0=0(12) +0 


Hence n = 275 = 1-1274+ 10-124+ 11-12° = 1TE. 
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Example 28. Ifn is 160 in base 10, what is it in base 12? 


160 = 13(12) + 4 


13 = 1(12) +1 
1=0(12) +1 
0 = 0(12) +0 


Hence n = 160 = 1-127 + 1-12 +4-12° = 114 (2). 


Conversion from Duodecimal to Decimal System 
Given an integer n with duodecimal representation 
N= (Agax—-1 ...Ao)12, 
its decimal representation can be obtained by computing 
n= 12* a, + (oP Mag +eeet 12! a, +a. 


Example 29. Letn = 114). Thenn = 1-127+1-12'+4 = 14441244 = 16010. 


Example 30. Letn = (3T7)17. Thenn & 3-127+10-12'+7-12° = 432+1204+7 = 
559. 


Exercise 21. Convert the following Decimal numbers to their Duodecimal equiva- 
lents and vice versa. 


i. 523 

ii. 6,234 
iii, (2E4) 12 
iv. (T083)12 


Problem Set 1.5: 


1. Convert the following Binary numbers to Duodecimal numbers and vice versa. 


(a) (1110110)> 
(b) (1010101)> 
(c) (23T)12 
(d) (E5)12 
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. Complete the binary sums and difference without first converting to Decimal 


equivalents. 


(a) (1001)> + (1101)> 
(b) (101). + C111)2 

(c) (10011)> — (1101)> 
(d) (101010)> — (11001), 


. Complete the binary products and divisions without first converting to Decimal 


equivalents. 


(a) (1101)) + (1011)> 
(b) (101)2- (111)2 

(c) (11011). + (11)2 
(d) (1010101) = (101), 


. Another commonly used number system is the Hexadecimal system. This 


system works in base 16 and uses the symbols 
{0,1,...,8,9, A, B,C, D, E, F}. 


(a) Convert the following Decimal numbers to their Hexadecimal equivalents 
and vice versa. 
(1) 309 
(ii) 3471 
(iii) (52)i6 
(iv) (DFE)16 
(v) (ACE123)16 
(b) Complete the Hexadecimal sums and differences without first converting to 
Decimal equivalents. 
(i) (28)16 + (34)16 
(ii) (8C)i16 + (A7)16 
(iii) (34) 16 — (28) 16 
(iv) (349)16— 2F Is 
(v) (FED) 16 = (CBA) 16 


Chapter 2 
Greatest Common Divisors, Diophantine 
Equations, and Combinatorics 


Introduction This chapter deals with the structure of how prime numbers are put 
together (Prime Factorization) and the tools of number theory which include greatest 
common divisors (GCD), least common multiples (LCM). These tools provide the 
beginning of important theorems such as the Fundamental Theorem of Arithmetic 
and the Euclidean Algorithm and its byproduct. Problems which have only solutions 
in integers (Diophantine equations) are discussed. Application to combinatorics is 
also mentioned. 


2.1 GCD and LCM Through the Fundamental 
Theorem of Arithmetic 


Overview In this section, we use the Fundamental Theorem of Arithmetic (prime 
factorization), Theorem 1, to find the greatest common divisor (GCD) and the 
least common multiple (LCM) of two integers. We will first use the unique prime 
factorization to compute the GCD, then define the least common multiple (LCM) of 
two integers and give two ways of computing the LCM. One will be through prime 
factorization, one will be through its relationship to the GCD, and the third will be 
an algorithm similar to that for calculating the GCD. This approach is the one with 
which you are probably familiar, as it is a part of the mathematics of elementary 
school. It is covered in the first semester of a content based math course for future 
elementary and middle school teachers. 

We continue in this section with the idea of divisibility, but instead of asking 
whether a given integer a is divisible by the integer d, we look at whether two 
integers a and b are both divisible by d. We are especially interested in the largest 
integer which is a divisor of a and also a divisor of b. That integer will be called the 
GREATEST COMMON DIVISOR of a and BD. It proves very useful in problem solving, 
showing that there are real numbers that aren’t fractions (are irrational.) 
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The formal definition of this concept and its first applications are the contents of 
this section. In the next section we will then use the Division Algorithm to find a 
very efficient method (the Euclidean Algorithm) to find the greatest common divisor 
of two numbers. 


Definition. Let a and b be integers neither of which is zero. If c € N, c divides 
a and c divides b then c is called a COMMON DIVISOR of a and b. A positive 
integer, d, is the GREATEST COMMON DIVISOR of a and b, written d = GCD(a, b), 
provided that 


1. d is acommon divisor of a and b and 
2. if c is any common divisor of a and b, thenc < d. 


If n € Z then the set of positive DIVISORS of n is the set of elements 
d(n) = {c € N | c divides n}. 
Note that while a divisor of n may be negative, the greatest common positive divisor 
of two numbers must be positive by definition. 
Example 31. What is the greatest common divisor of 24 and 60? 


Solution. We collect the positive divisors of 24 and 60 in the sets: 
d(24) = {1,2,3,4, 6, 8, 12, 24} 
and 
d(60) = {1,2,3,4,5, 6, 10, 12, 15, 20, 30, 60}. 


The positive common divisors of 24 and 60 is the set of elements in both d(24) and 
d(60); that is, d(24) N d(60). Since 


d(24) N d(60) = {1,2, 3,4, 6, 12} 


and the largest common element of these is 12, the GCD(24, 60) is 12. // 
Exercise 22. Find the greatest common divisor of 36 and 118. 


The way in which we found GCD(24, 30) is called the method of exhaustion. 
While it is not efficient in finding the GCD of two numbers (for example, try to find 
the GCD(136, 62) with this method and compare with your efforts in Example 38 
when we get there), it does show that there is a GCD(a, b) for any numbers a and b. 
Here is the reasoning: Continuing to use the notation d(n) for the set of divisors 
of n, we note that both d(a) and d(b) are finite sets and they have the integer 1 
in common. Therefore their intersection has a largest element. This element is the 
greatest common divisor of a and b and is at least 1. Symbolically, 


1 < GCD(a, b) = largest element of (d(a) N d(b)) < minimum{|a], |b]}. 
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Let’s look again at Example 31 but see what happens if we use the Fundamental 
Theorem of Arithmetic to factor both 24 and 60 as the product of primes. 


Example 32. Using prime factorization, find the GCD(24, 60). 


Solution. We note that 
24 = 23.3 and 60 = 27-3-5 (2.1.1) 


where we have purposely written the primes in increasing order. Looking at the 
prime factorization, Eq. (2.1.1), we see that the largest integer which divides both is 


d = 2.3 = 12 = GCD(24, 60). 7 

Thus we have found the GCD(24, 60) by factoring both numbers into primes and 
then seeing which primes are in both the factorizations. In this case, they are 2 and 
3 but not 5. We then see what is the largest exponent of these primes that appears 
in each of them (exponent of 2 for the prime 2 and 1 for the prime 3). The same 
procedure always works, although the notation gets to be complicated as we see in 
Theorem 3. 


Theorem 3 (GCD via Prime Factorization). Let a and b be positive integers and 
suppose that the prime factors of a are p\,... px and the prime factors of b are 
qi,---q. We then write 

=P) 8 P, One Gig 
where each of the exponents is positive. If the primes are re-arranged so that the 
first v of them are equal (pj = q\,... Pv = qv), then the GCD(a,b)= zB 2. pe 
where t; is the minimum of r; and 5;. 


To show the theorem, we need the following lemma. 


ny nz 


Lemma 1. Let n = p}'p;?--- p;' with nj > 0 for each i and let m be a positive 


integer. Then m | n if and only if m = p}'' p;? +++ p;"' with m; <n; for each i. 


Proof. lf m = pj"! p3? +++ p;"’ with m; <n; for eachi, then 
M= pylers pe = (pyr ees pe) (py s+ py). (2.1.2) 
Since n; > m;, we have pj! "!-+- p;!"" > Landm | n. 


Conversely if m | n, then n = mk for some integer k. Since n = pj! --- p7' 


and the prime power decomposition of n is unique (Corollary 1), we have the prime 
power decomposition of m and k must be the same as those in n. Therefore, we 
have 


m= py'+++p; andk = pi ep. Mees Oe =o. (2.1.3) 


Since n = mk, we haven; = m; + k; and then n; > m;. | 
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Now we are ready to prove Theorem 3. 


Proof. It suffices to show that if a = p}'p,---p% and b = pj'py---p, 


then GCD(a,b) = pj'p?--: pl’ where ¢; is the minimum of r; and s;. Let 
d= at pp pe. Since ¢; is the minimum of r; and s;, t; < r; andt; < 5;. 


Therefore, by the above lemma, d | a andd | b. Suppose now g | a and g | b. Then 
by the lemma above again, g = oe -++ pS with g; < s; and g; < r; for eachi. 
But since ¢; is the minimum of s; and 7;, this implies that g; < ¢;. Therefore, again 
by the above lemma, g | d. Thus, d = GCD(a, db). (- 


Remark. The above lemma provides an easy way to find all the positive divisors of 
a positive integer once its prime power decomposition is obtained. 


Example 33. Find all positive divisors of 108. 


Solution. Since 108 = 2? x 3%, the divisors of 108 are 


or 
1-1 1-2 1 
3-1 3-2 3-2? 
9? «1 37-237 +2? 
Ta US ae a 
// 
Example 34. Find the GCD(504, 3675) using the prime factorization method. 


Solution. It is easy to find that 
504 = 23. 37-7 and 3675 = 3! - 57.7. 


Thus, since the only primes that are in both numbers are 3 and 7 and each has 
exponent | in one of the numbers, the theorem says that 


GCD(504, 3675) = 3-7 = 21. i 

Recall that an integer greater than | is prime if its only positive divisors are itself 

and one. Of course, a prime number p is also divisible by — p and —1. By analogy, a 

very appealing situation will be when, given two integers, the only common positive 

divisor they have is 1. We formalize that concept in the next definition which will 
be quite important for the remainder of this book. 


Definition. The integers a and b are RELATIVELY PRIME if the GCD(a, b) = 1. 
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Being relatively prime is one extreme relationship for a pair of integers. The next 
exercise is concerned with another extreme relationship. 


Exercise 23. If a,b € Z and GCD(a, b) = b, what is the relationship between a 
and b? (Prove that your answer is correct.) 


We now define rational and irrational numbers and apply the ideas above to show 
that the a/ 2: /3 and V5 are irrational (in fact, the same logic can show that ,/p is 
irrational if p is a prime number). This is another example of problem solving using 
an established foundation (in this case, the concept of a GCD.) 


Definition. If x is a real number, x = * where r,s € Zand s # 0, then x is called 
a RATIONAL NUMBER. We Say x is an IRRATIONAL NUMBER if it is not a rational 
number. 


Proposition 3 (Reduction to Lowest Terms). Jf x is a non-zero rational number, 


then x can be written as 


i= where a and b are relatively prime integers. (2.1.4) 


Proof. Suppose x is rational so that x = = forr,s € Z. Letd = GCD(r,s). Then 


If we leta = 7 and b = * then both are integers, since d is a divisor of both and 
are relatively prime (why?). Thus, we have written the integer x as in Eq. (2.1.4). 


The ability to represent any rational integer as the quotient of two relatively 
prime integers (“reduce to lowest terms’’), as in (2.1.4), allows us to prove results 
by the technique called “proof by contradiction”. The following theorem uses this 
technique as do Theorem 9 and the Rational Root Theorem (Theorem 20). 


Theorem 4. The number V2 is an irrational number. 


Proof. To prove that </2 is irrational, we will assume the opposite (i.e. that /2 is 
rational) and show that assumption leads to a contradiction and so is incorrect. Since 
the assumption that ./2 is rational is false, it must be that V2 is irrational, which is 
what we are trying to prove. This technique is called “proof by contradiction’. 

Assume for the moment that the /2 is rational and so it is the quotient of two 
integers. Thanks to (2.1.4), we may assume that 


A= 7 where GCD(a,b) = 1. (2.1.5) 


This equation means that a = b/2 or 


2 = 25? (2.1.6) 
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and so a? is even. If a were odd (say a = 2 + 1), its square (407 + 44+ 1 = 
2(207 + 2£) + 1) would also be odd. Therefore, a is even which means that a = 2k 
for some k € N. Putting a = 2k into (2.1.6) yields 


Ak? = 257. 


Thus b?, and hence b, is even by the same argument. Since both a and b are even, 
each is divisible by 2. Thus, GCD(a,b) > 2, which contradicts our assumption 
({Eq. (2.1.5)] and the proof is finished. | 


We now turn our attention briefly to the concept of least common multiple (LCM) 
of two integers. While the GCD will be much used in the remainder of this chapter, 
we will treat the LCM briefly as it will not play as much of a part in the remainder 
of this text. However, both the GCD and the LCM are very much a part of the 
mathematics of elementary schools. 


Definition. The LEAST COMMON MULTIPLE of two integers a and b, LCM(a, b), 
is the smallest positive integer that is a multiple of both a and b. 


We can recast this definition to be similar to the definition of GCD(a,b) by 
saying that / = LCM (a, b) if 
1. / isa multiple of a and of b 
2. if k is any other multiple of a and b then/ < k. 


If n € Z then the POSITIVE MULTIPLES of 7 are collected in the set: 
m(n) = {c € N|n divides c}. 


The LCM (a, b) is the smallest integer in m(a) N m(b). 

Note that all elements of m(a) and m(b) are positive and if a and b are either 
both positive or both negative, then m(a) and m(b) share at least the element ab 
and thus there is a LCM(a, b)—that is, m(a) N m(b) # 9. 


Example 35. Find the least common multiple of 270 and 630. 
Solution. We will use the prime factorization of 270 and 630 to find the 
LCM (270, 630). It is easy to show that 

270 = 2-3°-5 and 630 = 2-3°-5-7. 


Any multiple of both 270 and 630 must have in its prime factorization at least 
one 2, three 3s, one 5, and one 7 in it. Thus, the smallest multiple of both numbers 
must be 


LCM (270, 630) = 2-33-5-7 = 1890. P 
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It isn’t difficult, from looking at this example to prove the following theorem 
which is analogous to Theorem 3 which uses prime factorization to find the GCD 
of two integers. Once again, the difficult part of the proof is keeping track of the 
notation. 


Theorem 5 (LCM via Unique Prime Factorization). Let a and b be positive 
integers and suppose that the prime factors of a are p, ... px and the prime factors 
of b are q, ...qi. We then write 


CP ih ANOS G, oct, 


where each of the exponents is positive. If the first v of the primes are equal (p, = 
41,-+» Pv = qv) and t; is the maximum ofr; ands; for 1 < j < v then 
LOME DS PF isc P 0 oo Gy es 
We will not include a proof of the fact but ask the reader to supply one for a 
number of cases in which the notation is easier. 


Proposition 4. [fa and b are non-zero integers and d = GCD(a, b) then there are 
relatively prime integers h and k with 


a = dhand b = dk (2.1.7) 


Proof. Since d is the GCD of a and b, d|a and d |b, so that there are integers h and 
k which satisfy Eq. (2.1.7). We will now show that, in fact, # and k are relatively 
prime. 
Define r = GCD(h, k). If r = 1, then, by definition, h and k are relatively prime 
and we are done. Note that A and x are integers and a = (rd) (*) and b = (rd) (£). 
If r > 1, then rd > d, rdla, and rd|b which is a contradiction since d = 
GCD(a, b). Therefore r = 1, so A and k are relatively prime. | 


Theorem 6. For any non-zero integers a and b, 
GCD(a, b)LCM(a, b) = ab. 


Proof. Let GCD(a,b) = d. Then there exist nonzero positive integers u and v 
where a = du and b = dv and GCD(u, v) = 1. Then we see that LCM (a, b) = uvd 
and: 


GCD(a, b)-LCM(a, b) = d - (uvd) = (du)(dv) = ab. 


Theorem 6 provides a direct link between the GCD and LCM of two arbitrary 
positive integers. Thus given either the GCD or LCM, one is able to find the other 
by a simple division of integers. However there is a method which simultaneously 
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determines the GCD and LCM of two positive integers using unique factorization; 
we call the following method “simultaneously determining GCD and LCM”. 
Example 36. Determine LCM(40, 155) given that the GCD(40, 155) = 5. 


Solution. By Theorem 6, we see that: 


40x 155 6200 
GCD(40, 155) 5 


LCM (40, 155) = = 1240=5x8x3l. 


If we do not assume that the GCD(40, 155) = 5, we can still determine the LCM 
quickly. We notice from the divisibility tests that 5 is a common divisor of 40 and 
155. We create the following table displays the quotients of dividing both integers 
by 5. 


5|40 155 
8 31 


Since 8 and 31 are relatively prime, we see that 5 = GCD(40, 155). Also we can 
read the LCM off of the table by multiplying the divisor by the relatively prime 
quotients. Hence LCM (40, 155) = 5 x 8 x 31. // 


This process can be repeated in the case we do not guess the GCD for our 
first divisor. Recall Example 34 where we found GCD(504, 3675) = 21. We 
now produce an alternative solution to this example using the “simultaneously 
determining GCD and LCM” method. 


Solution (Alternative Solution to Example 34). Using the divisibility tests we 
first recognize that 3 is a common divisor to 504 and 3675. After division by 3 we 
recognize that 7 is the smallest prime that divides the quotients. 


504 3675 
168 1225 
24 175 


|~1| & 


Since 24 and 175 are relatively prime, we have determined GCD(504, 3675) = 21. 
We have additionally determined: 


LCM (504, 3675) = 3x 7x 24x 175 = 2? 3? x 5° x 7 = 88200. 
Exercise 24. Using the “simultaneously determining GCD and LCM” method, 


determine the LCM of 270 and 630. 


The remainder of this section is needed only in an exercise in Sect. 2.3 but not 
in the material of this text. It indicates how to extend these ideas to more than two 
integers. 
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Definition. If a, b, c are integers (not all zero) then the GREATEST COMMON 
DIVISOR of a, b, and c is the largest integer which divides all three. We will write 
GCD(a, b,c) for the greatest common divisor. 


Example 37. What is the GCD(12, 18, 30)? 


Solution. Since the numbers 12, 18, and 30 are relatively small we can easily write 
down all of their divisors individually. A good way to organize this information is 
in the form of a chart. 


Number | Divisors 
12 1, 2, 3, 4, 6, 12 
18 1, 2, 3, 6, 9, 18 
30 1, 2, 3, 5, 6, 10, 15, 30 
From the chart, we see that GCD(12, 18, 30) = 6. // 


Exercise 25. Prove that for any integers a, b, c not all zero, GCD(a, b, c) exists. 


Remark. Unfortunately the GCD and LCM of three positive integers a, b, c do not 
behave as nicely as the case we considered only two integers. To be specific, in 
general: 


GCD(a, b,c)LCM(a, b,c) £ abc. 
Exercise 26. Find an example of three distinct integers a, b, c such that 


GCD(a, b,c) -LCM(a, b,c) € abc. 


Problem Set 2.1: 


1. Find the GCD(14,22) by both the method of exhaustion and via prime 
factorization. 

2. Find the GCD(600, 28000) by both the method of exhaustion and via prime 

factorization. Which is easier? 

. Find the LCM(14, 22) via prime factorization. 

. Find the LCM(600, 28000) via prime factorization. 

5. Show that GCD(a, b)LCM(a, b) = ab in the cases below (see Problems 1-4 
above). 


(a) a= 14andb = 22 
(b) a = 600 and b = 28000 


Rw 


. Prove that 3 is irrational. (Hint: Look at the proof of Theorem 4.) 

. Prove that 5 is irrational. 

. If is a rational number, prove that r 4/5 is irrational. 

. Ifr is an irrational number, is it necessarily true that rJ/5 is irrational? If this is 
a true statement, then prove it. If it is false provide a counter example. 
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Let n be between 1000 and 1200 inclusively. Assume that if n is divided by 
7,9, 11, the remainders are 4, 6, 8 respectively. What is the smallest value of 1? 
Let n be between 9000 and 10,000 inclusively. Assume that if 1 is divided by 
5, 9, 13, or 17, the remainders are all 3. What is the smallest value of n? 

John goes to the library every 2 days and Linda goes to the library every 3 days. 
They met at the library last Sunday. When will they meet again on a Sunday? 
Every 5 days Hayden eats steak at the town diner. Lucy eats at the same diner 
every 2 days and if she goes on a Sunday, she brings her friend Janice. If 
Hayden, Lucy, and Janice all ate at the dinner on Sunday, January 1, when will 
be the next date all three of them eat there again? 

Suppose for positive integers a and b, we know ab = 1200 and GCD 
(a,b) = 10. 


(a) Ifa > b, what are the possible combinations for a and b? 
(b) For each combination found in Part a, determine LCM (a, b). 


Suppose that there is only one prime in the factorization of a and only one in 
the factorization of b so that 


a= p* andb=q’ (2.1.8) 


where p and qg may be different. 


(a) According to Theorems 3 and 5, what is the GCD(a, b) and the LCM(a, b)? 
(You will need to separate out two cases: p = g and p # q.) 
(b) Prove Theorems 3 and 5 for the special case of Eq. (2.1.8). 


Suppose that there are two primes in the factorization of a and two primes in 
the factorization of b and all four primes are different. 


(a) According to Theorems 3 and 5, what is the GCD(a, b) and LCM(a, b)? 
(b) Prove Theorems 3 and 5 for this special case. 


Determine the GCD and LCM of 150 and 365 first using prime factorization 
and then using the “simultaneously determining GCD and LCM” method. 
Determine the GCD and LCM of 1050 and 3276 using the “simultaneously 
determining GCD and LCM” method. 

There are 48 oranges, 36 mangos and 60 pears to be distributed in boxes. The 
distribution mandates that each box contains an equal amount of the same fruit. 
What is the maximum number of boxes needed and how many oranges, mangos 
and pears are in each box? 

Let a,b, 41, 92,93 € N and let 


a = bq, + 4098 
b = 4098q2 + 582 
4098 = 582q3 + 24. 


What is GCD(a, b)? (Answer: 6) 
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2.2 GCD, the Euclidean Algorithm and Its Byproducts 


Overview We now turn to the use of the Division Algorithm (see Sect. 1.1), which 
allows us a more efficient way of finding the GCD. The method about to be 
presented was discovered by the Greek mathematician Euclid about 2,000 years ago 
and is called the Euclidean Algorithm. We first prove the theorem and then show 
its usefulness through some examples. 

Note that a and —a have the same set of divisors for any integer a. Thus, in 
computing GCD(a, b) we can assume that a and b are nonnegative as GCD(a, b) = 
GCD((a|, |5]). 


Theorem 7 (Euclidean Algorithm). Given a,b € N where b > a anda + b, 
apply the division algorithm successively as follows: 


b=qa+t+n,0<r<a 
a=Qrit+nm0<nm<rn 


ry = g3r2 + 73,0 < 73 < ro 


ln—2 = Qnln-1 + Tn, 9 <Tn <Tn-1 
Tn-1 = Qn+1ln +0 


Then r, = GCD(a, b). That is, the last non-zero remainder that appears in this 
manner is equal to GCD(a, b). 


The proof of this theorem is based on the following lemma: 


Lemma 2. Let a,b € Z, and assume that a > 0 and a does not divide b. Let q and 
r be the quotient and remainder respectively of b when divided by a as guaranteed 
by the Division Algorithm. Then 


b=qa+r,0<r<a (2.2.1) 
and GCD(a, b) = GCD(a,r). 


Proof. Equation (2.2.1) is just the statement of the Division Algorithm so we only 
need to show that GCD(a, b) = GCD(a,r). Let d = GCD(a, b). To prove that d is 
also equal to the GCD(a, r), in accordance with the definition of the GCD, we must 
prove two things: 


(i) d is a divisor of both r and a 
(ii) d is the largest common divisor of r and a. (2.2.2) 
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We first prove Part (i). Now, d = GCD(a, b) means that d divides b and d divides a. 
Thus Eq. (2.2.1) shows that d also divides r since 


d|\(b — qa) =r. 


Since d is already a divisor of a, d is a common divisor of r and a, which is what 
Part (i) requires. 7 _ 

To prove Part (ii), we let d be any common divisor of r and a and show that d is 
no larger than d. Certainly, since d is a divisor of r and a, 


d|r and dja. (2.2.3) 


We now show that d <dza= GCD(a, b) to establish statement (2.2.2). However, 
Eqs. (2.2.1) and (2.2.3) show that d must divide b since 


d\(qa+r)=b 
and d|a. Therefore 
d < GCD(a,b) = d, 
which means that d is no larger than d. Hence Part (ii) [statement (2.2.2)] is 
established. 


We are now ready to prove the theorem. 


Proof. Using this lmma we have GCD(a, b)= GCD(a, r})=... = GCD(1n-1, Tn) 
= GCD(r,, 0) = rp. |_| 


Example 38. Find the greatest common divisor of 136 and 62 using the Euclidean 
algorithm. 


Solution. Since 136 = 2(62) + 12, the remainder of 136 on division by 62 is 12. 
We can apply the Euclidean Algorithm and obtain 
GCD(136, 62) = GCD(62, 12). 


But 62 = 5(12) + 2 so another application gives (since 2 is the remainder of 62 on 
division by 12), 


GCD(136, 62) = GCD(62, 12) = GCD(12, 2) = 2. ‘i 


The process of Example 38 can be displayed (with the circled numbers being the 
multipliers and subtraction made in the vertical columns) by the chart below. 
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136 @) 62 


The last line gives the GCD(12,2) = 2 by inspection since 2 divides 12. Note 
that the GCD is in a box, not a circle. 


Exercise 27. Find the GCD(118,36) using the Euclidean Algorithm as in 
Example 38. 


From the Euclidean Algorithm (Theorem 7) we next gain a wonderful byproduct 
(Theorem 8). We will first look at some examples which depend on the important 
observation that the remainder of a on division by } can be written as an integer, 
x, times a plus another integer, y, times b. This phenomenon is called expressing 
r as an integer linear combination of a and b. In Example 38, r = 2,a = 136, 
b = 62, and 2 = 62x + 136y where x = 11 and y = —5. How to obtain x = 11 
and y = —5 is difficult. Right now we are only asking you to do arithmetic so that 
2 = 62(11) + 136(—5). 

We now carefully define the necessary terminology to develop another method 
for solving GCD problems, mainly backwards substitution. We will repeat 
Example 38 to show how to obtain x = 11 and y = —5 and along with Example 40 
supports this method before stating and proving in a special case the Euclidean 
Algorithm Byproduct. 


Definition. Let a,b € Z. r is an INTEGRAL LINEAR COMBINATION of a and b if 
there are integers x and y such that 


r=ax+by. (2.2.4) 


We will usually just write that r is a linear combination of a and b unless there is 
some ambiguity. 

Example 39. Recall from Example 38 that GCD(136,62) = 2. Using some 
arithmetic insight (guessing) we see 


2 = 62(11) + 136(—5) 


which shows Eq. (2.2.4) to be true in this case with x = 11 and y = —5 and 
r = GCD(136, 62) = 2. Thus, we are able to show that 2 = GCD(136, 62) is a 
linear combination of 136 and 62 because we were lucky and realized that 


2 = 62x + 136y if x = ll andy = —5. (2.2.5) 


54 2 Greatest Common Divisors, Diophantine Equations, and Combinatorics 


Showing that 2 = 62x + 136y where x = 11 and y = —5 is all well and good 
(and mathematically correct), but how were we led to “guess” x = 11 and y = —5? 
Our method (which will motivate a proof of the Euclidean Algorithm Byproduct) is 
as follows. 

Since 136 = 2(62) + 12, we have that 


136 — 2(62) = 12. (2.2.6) 
But 62 = 5(12) + 2 means that 
62 — 5(12) = 2. (2.2.7) 


We now go “backwards.” Putting (2.2.6) into (2.2.7) gives, with a = 62 and 
b = 136, 
2 = 62 — 5(136 — 2(62)) = a — 5(b — 2a) 
= lla—5b. 
Thus 2 = (62)11 + 136(—5), as promised, and the GCD(a, b) is indeed a linear 
combination of a and b. Using the Euclidean Algorithm Byproduct will mean that 


we can obtain the linear combination of (2.2.6) with x = 11 and y = —5 without 
guessing. 


Here’s another example which gives insight into how the proof of Theorem 8 
should work. 


Example 40 (Backwards Substitution). Let b = 123 and a = 36. Find the 
GCD(123, 36) and show that it is a linear combination of 123 and 36. 


Solution. The Division Algorithm gives: 


123 = 3(36) + 15 (A) 
36 = 2(15) + 6 (B) 
15 = 2(6) +3 (C) 

6 = 2(3) +[0] (D) 


so that using the Euclidean Algorithm, Theorem 7, 
GCD(123, 36) = GCD(36, 15) = GCD(15, 6) = GCD(6, 3) = 3. 
The answer for the first part of this example is that GCD(123, 36) = 3. 


However, now look at what the chain of substitutions from Eqs.(A), (B) 
and (C) gives! We will now show that these three equations give the “integral 
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linear combination” result which appears is the Euclidean Algorithm Byproduct, 
Theorem 8. A simple “moving to the other side” process on (A), (B) and (C) yields: 


123 — 3(36) = 15 orb —3a = 15 (A’) 
36 — 2(15) = 6 (B’) 
15 — 2(6) =3. (C’) 


Starting at (C’) and using (B’) with b = 123, a = 36 gives 


GCD(123, 36) = 3 = 15 — 2(36 — 2(15)) 
= 15 —2a + 4(15) = 5(15) —2a. 


Now, continuing going backwards up to (A’) to replace 15 by 123—3(36) = b—3a, 
gives: 


3 = 5(123 — 3(36)) — 2a = 5b — 15a — 2a. 
Therefore: 
GCD(a, b) = GCD(123, 36) = 3 = 5b + (-17)a 


and so the GCD(a, D) is a linear combination of a and b. // 


In number theory, the process of Example 40 is (sometimes) called BACKWARDS 
SUBSTITUTION. Look at the pattern in Eqs. (A), (B), (C), and (D)—the three 15’s 
run diagonally, then the three 6’s, then the two 3’s and note that 15, 6, and 3 are 
remainders. 

Backwards substitution can be displayed in a tableau (TABLEAU METHOD) which 
is more effective than the classical way. The construction of the table below ends 
with the GCD(a, b) expressed as an integral combination of a and b as we see in 
Example 41. 


Example 41 (Tableau Method). Write the GCD(123, 36) as a linear combination of 
123 and 36 using a tableau method. 


Solution. We will keep track of the operations by writing a table (“tableau’’) 
which follows closely Eqs.(A), (B), (C), and (D) above and looks like the table 
of Example 40. We are writing b = 123 anda = 36. Note that the integer below 
the line is the difference of the two above and is Eq. (A). Similarly, the next vertical 
triple is (B) and the third is (C). 
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b 123 


3a 108 oo 
b-3a 15 ee 36 a 
30 2(b-3a) 


b-3a 15 6 a-(2(b-3a))=7a-2b 
2(7a-2b) 12 eae 
b-3a-2(7a-2b) Bl 6 =5b-17a 


Thus the GCD(a, b) = 3 = 5b — 17a. As a check, 3 = 5(123) — 17(36). // 


Example 42. Use the classical and Tableau methods for finding the GCD(13, 5) as 
a linear combination of 13 and 5. 


Solution. Classically, using the Euclidean Algorithm three times, one can write 


13=2-543 
5=1-342 
3=1-2+4+1 
2=2-1 


and then back substitute. Then we obtain: 


1=3-1-2 
2=5-1-3 
3=13-2-5 


Therefore 1 = 3—1(5—1-3) = 3—5+4+1-3 = 2-3—5 = 2(13—2-5)—5 = 2-13—5.5. 
Note that the GCD(13, 5) = 1 from the last line. 
The tableau method, with b = 13 and a = 5, can be constructed from the above 
equations as: 


b 13 5 a 
Qa 10 eee 
b-2a 3 —~Q a 


b-2a 


Nw] w ot 


b-2a 3 a-(b-2a)=3a-b 
3a-b 2 poe al 


2b-5a 1 


Thus 1 = 2b — 5a which solves GCD(13, 5) = 1 = 13x + 5y with x = 2 and 
y=—5. / 
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Theorem 8 (Euclidean Algorithm Byproduct). [fa and b are non-zero integers 
then the GCD(a, b) is an integral linear combination of a and b. 


We will now give a proof of Theorem 8 (Euclidean Algorithm Byproduct) in 
the case when we need four iterations of the Euclidean Algorithm, which was the 
situation sketched out in Example 40. A general proof follows in exactly the same 
manner but the notation gets a bit awkward during the backwards substitution. 


Proof (Sketch). Given a,b € Z, assume that the GCD(a,b) is known and for 
simplicity assume b > a > 0. We use the Euclidean Algorithm to obtain quotients 
qi. 42, 43 and q4 and remainders 71, r2, r3 (respectively) in its fourth step (again, this 
assumption is to simplify the notation, although the same ideas work in the general 
case). Thus: 


b=qatn O0<r, <a (E) 
a= Qrit+ro 0<m<r (F) 
ry = gro + r3 0<1r3<ro (G) 
ro = qar3 + 0. (H) 


Note again the diagonal and vertical pattern with the remainders 1, r2, and r3. We 
know that 


GCD(a, b) = GCD(a, r}) = GCD(r, r2) = GCD(r2, r3) = 73. 


We now rewrite Eqs. (E), (F), and (G) (mimicking Example 40) 


b-qa=r (E’) 
a— qr, =r (F’) 
| — Q3r2 = 13. (G’) 


Then starting with (G’) we perform the backwards substitution, first with (F’): 
GCD(a,b) = r3 = 1 — garo = 11 — q3(4— Qari) 
= (1+ 93q2)ri — qaa. 
Now using (E’), we complete the backwards substitution: 
GCD(a, b) = (1 + 93q2)(b — qia) — q3a 
= (1 + 93q2)b + (—q1 — 93 — 919293). 
Therefore 


GCD(a, b) = xa+ yb 
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where x and y are the integers given by 


y = (1 + qoq3) and x = —(qi + 93 + 919293). 


Thus, the GCD(a, b) can be written as xa + yb where x, y € Z are as in the last 
equation. This linear combination is the conclusion of the Euclidean Byproduct 
Theorem in the special case when the process stops in its fourth step. a 


Exercise 28. Write the GCD(118, 36) as an integral linear combination of 36 and 
118 (see Exercise 27) by mimicking the proof of Theorem 8. 


Given a,b € Z, the use of Theorem 8 as a method to find integers x and y such 
that the GCD(a,b) = ax + by can be cumbersome, as we saw in Example 40. 
Furthermore, there are many possible integer coefficients x and y that work. For 
example, if xo and yo is a solution of 


GCD(a, b) = xoa + yob 


then, for any tf € Z, x = xo + bt and y = yo — at are also solutions of Eq. (2.2.4) 
(See Problem 5 of Problem Set 2.2). We will see more of the characteristics of other 
solutions in the next section. 


Definition. Let a,b € Z be given. If xo and yo are integers such that 
GCD(a, b) = xoa + yob 


then (xo, Yo) is called a PARTICULAR SOLUTION of the integral linear combination. 
A GENERAL SOLUTION of the integral linear combination of a and b is 


{(x, vy) | x = x9 + bt and y = yo — at for some t € Z}. 


The notion of particular and general solutions will play a major role in the next 
section, Diophantine Equations, and will allow us to solve many problems which 
require integer solutions. 

Here is another technique for finding a particular solution of the integral linear 
combination. 


Example 43. Find a particular solution and general solution of the integral linear 
combination for the GCD(611, 235). 


Solution. Some computation shows that GCD(611, 235) = 47. We are looking for 
integers x and y such that 


611x + 235y = 47, 
which can be reduced to 


13x+5y=1. (2.2.8) 
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ne _ =10x Sel = 


Thus y = —2x + Set . We must find x and y, both 
integers, such that Eq. (2.2. 8) i is satisfied. Guessing in Eq. (2.2.8) is easier than its 
predecessor above. First, we pick a value of x such that the second term — a is an 
integer. We see x = 2 certainly works, which yields y = —4 + (—1) = —5 and so 
(2, —5) is a particular solution of 611x + 235y = 47. (There are, of course, other 
choices for x which will work.) 


The set of general solutions is, in the case of x» = 2 and yp = —5 


{(x, y) |x = 2+ 235t, y = —5 — 6111 for some t € Z}. 
i/ 


Exercise 29. Find a particular solution for an integral linear combination which 
equals GCD(7, 19) by using the technique of the previous example. 


Recall from the previous section that if a,b € Z, and neither are zero, then a and 
b are defined to be relatively prime if they have no common positive divisor other 
than 1. Put more succinctly, GC D(a, b) = 1. The following result, which follows 
from the Euclidean Algorithm Byproduct, is left as Problem 11 of Problem Set 2.2. 


Proposition 5. Let a and b be positive integers. Then a and b are relatively prime 
if and only if 1 is an integral linear combination of a and b. 


Example 44. Find all positive integers a and b such that 
a+b = 156and GCD«(a,b) = 26. 


Solution. Since the GCD(a, b) = 26, Proposition 3 (page 45) shows that there are 
relatively prime, positive integers h and k with 


a = 26h and b = 26k. 


Since a + b = 156, we have that 


156 
a 
= 26 


Therefore, as h and k are relatively prime, we see by examination the only 
possibility is that one of them is | and the other 5. So the only solutions are a = 26 
and b = 130 (orb = 26 anda = 130). // 


Before you look at the next exercise, which you should do in a manner similar to 
Example 44, think about the following problem: 


“Find two integers, whose sum is 84, both of which are divisible by 7 but not by any larger 
integer.” 


Without the concepts of GCD and the Euclidean Algorithm Byproduct (Theorem 8), 
this would be a very difficult problem. However, because we have established 
machinery (a foundation), you will find the problem solving easy (Exercise 30). 
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Exercise 30. Find all positive integers a and b such that 


a+b = 84and GCD(a,b) = 7. 


Proposition 6. [fa and b are relatively prime and a|bc then alc. 


Proof. If c = 0, there is nothing to prove so we may assume that c # 0. a and 
b are relatively prime which means that the GCD(a, b) = 1. Using the Euclidean 
Algorithm Byproduct, we may therefore express | as a linear combination of a and 
b. That is, there are integers x and y with 


1=xa+ yb. (2.2.9) 
Because a|bc, there is an integer k such that 
ka = be. 
Equation (2.2.9), solved for b, 
b = (1—xa)/y 


can be substituted into ka = bc to give: 


or (kya =c—(xc)a. 


Thus (ky + xc)a = c and so a divides c as (ky + xc) is an integer. a 
Corollary 3. If m|a and n\|a with GCD(m,n) = 1, then mn|a. 


Proof. Since m|a, a = mq for some q € Z. Since nla, n|mq, and therefore n|q 
since GCD(m,n) = 1. Therefore we have that q = nt for some t € Z, and so 
a=mnt => mala. i 


Exercise 31. Give an example of positive integers a, b and c such that a|bc but 
a does not divide either b or c. (This exercise shows that the hypotheses that 
GCD(a, b) = 1 cannot be dropped in Proposition 6.) 


Problem Set 2.2: 


Assume throughout this problems set that neither a nor b are zero. 
1. Find the GCD(46, 69) using the Euclidean Algorithm, and use the method of 


backwards substitution to express it as a linear combination of 46 and 69. 

2. Find the GCD(46, 69) by the tableau method and write it as a linear combination 
of 46 and 69. 

3. Use the Tableau Method to write GCD(136, 62) as a linear combination of 136 
and 62. Compare your answer to Example 39. 


2:3 


oo 


11. 
12. 


13. 
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. Suppose that a and b are integers and that 1 is a linear combination of a and b. 


Prove that a and b are relatively prime. Compare this result with Theorem 8. 


. If Xo, yo € Zis a particular solution for GCD(a,b), t € Z and if x = xo + bt 


and y = yo — aft then GCD(a,b) = xa + yb. (This shows that every general 
solution actually gives the GCD(a, b) as an integer linear of a and D.) 


. (a) If a and b are relatively prime, a is odd, and b is even, prove that (a + b) 


and (a — b) are relatively prime. 

(b) If a and b are both even, prove they are not relatively prime. 

(c) If a and b are both even or both odd and a ¥ b prove that (a + b) and 
(a — b) are never relatively prime. 


. Find all positive integers a and b such that a + b = 15 and GCD(a, b) = 3. 
. Find all positive integers a and b such that a + b = 132 and GCD(a, b) = 11. 
. Assume that a and b are both positive integers and xp and yo are integers with 


GCD(a, b) = xpa+ yob. Prove that, for any t € Z, x = x9 +btand y = yo—at 
also gives the GCD(a, b) as an integral linear combination of a and b. (That 
means you must prove that GCD(a, b) = xa + yb.) 


. Find two integers whose sum is 192 both of which are divisible by 32 but not 


both divisible by any larger integer. 
Prove Proposition 5 and compare with Problem 4. 
(a) If k is a positive integer, prove that k and (k + 1) are relatively prime. 
(b) Let p be a prime number. 
(1) (experiment) Pick you favorite prime number p and for some examples 
(such asn = 1,2, up to 15), see when n + p and 7 are relatively prime. 
(2) (conjecture) What do you think is true? (There should be a pattern in 
your answer to #1.) Hint: Your conjecture should take the form “if p is 
a prime number and ????, then + p and 7 are relatively prime.” 
(3) (proof) Prove your conjecture. 
Find m,n € Z such that 945m + 219” = GCD(945, 219) using the Tableau 
Method. 


2.3 Linear Equations with Integer Solutions: 


Diophantine Equations 


Overview In the previous section, one of the key ideas we discussed was the 
existence of solutions (x, y) to ax + by = GCD(a,b) where x and y are both 
integers. Suppose we now consider the more general equation in which a, b, and c 
are integers (with a and b not both zero) such that 


ax+by=c (2.3.1) 


and again look for integer solutions. 
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The major result of this section is to give necessary and sufficient conditions for 
Eq. (2.3.1) to have integer solutions. In other words, we will give a condition on the 
integers a, b, and c so that Eq. (2.3.1) has a solution exactly when this condition 
is met (Theorem 9). In the case when there is a solution, we will give all of the 
(infinitely many) solutions. We apply this theory to the “Hundred Fowls” problem 
from sixth century Chinese mathematical literature. To solve the “Hundred Fowls” 
problem without the foundation that we will be establishing would be remarkable 
and it was! Philosophically it is frequently the case that to do problem solving is to 
build a foundation on existing platforms. Here’s its statement—it’s solution is at the 
end of the section. 


Hundred Fowls Problem (Burton [1]): If a cock is worth 5 coins, a hen 3 coins, and three 
chicks together 1 coin, how many cocks, hens and chicks, totaling 100, can be bought for 
100 coins? 


The Hundred Fowls Problem is a special case of Diophantine Equation, mathemat- 
ics problems whose solution involve only integers. We will only consider the linear 
case, the subject is quite deep and the machinery needed in general very abstract 
(and quite beautiful). 


Definition. Let a, b, and c be given integers (not all zero). An equation of the form 
ax+by=c (2.3.2) 


is called a LINEAR DIOPHANTINE EQUATION. A DIOPHANTINE SOLUTION to 
Eq. (2.3.2) is a solution (x, y) in which both x and y are integers. If x, y > 0, then it 
is a SOLUTION IN POSITIVE INTEGERS or a POSITIVE DIOPHANTINE SOLUTION. 


It is customary to apply the term Diophantine equation to any equation in one 
or more unknowns that is to be solved in the integers. We will work with only two 
unknowns. A solution of a Diophantine equation is a pair of integers (xo, yo) that, 
when substituted into the equation, satisfy it; that is, we ask that axp + byo = c 
and that x9 € Z and yo € Z. Thus, the coefficients in a Diophantine equation and 
the solution must all be integers. Of course, there are infinitely many real solutions 
(x, y) to (2.3.1)—our question for this chapter is whether there are solutions in 
which both x and y are integers. 

Although the dates of his birth and death aren’t known accurately, it is commonly 
thought that Diophantus was born in about 200 A.D. and died about 284 (both 
in Alexandria, Egypt). An interesting fact is that there is a crater on the Moon 
named after him! Curiously enough, the linear equation does not appear in the extant 
works of Diophantus (the theory required for its solution can be found in Euclid’s 
Elements), possibly because he viewed it as too easy. Most of his problems dealt 
with finding squares or cubes with certain properties among the integers, rather than 
the linear equations of Eq. (2.3.2). 
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Convention When the phrases “solve the Diophantine equation” or “what are the 
solutions to the Diophantine equation?” are used, it is understood that we are looking 
for Diophantine solutions; that is, we are searching for integer solutions only. 

A given Diophantine equation can have a number of solutions. For example, the 
equation 3x + 6y = 18 has solutions (4, 1), (—6, 6), and (10, —2) since 


3-446-1=18, 
3(-6) + 6-6 = 18, 
3(10) + 6(—2) = 18. 


(It may also have others; it fact, it does.) By contrast, there is no Diophantine 
solution to the equation 2x + 10y = 17, because the left-hand side is an even integer 
for any choice of the integers x and y, whereas the right-hand side is odd. Of course, 
there are many real solutions—it’s just that none of them have the property that 
both x and y are integers. Faced with these possibilities, it is reasonable to enquire 
about the circumstances under which a Diophantine solution is possible and, when 
a solution does exist, whether we can determine all integer solutions explicitly. 

The condition for solvability of the Diophantine equation is easy to obtain, as 
we’ ll now prove. We’ll first show that the linear Diophantine equation ax + by =c 
admits a solution if and only if d|c, where d = GCD(a, b). Here’s the reasoning: if 
d = GCD (a, b), then it must divide both a and b. Thus, we know there are integers 
r and s for which a = dr and b = ds. 

Now let’s assume that there is a Diophantine solution to (2.3.1). If (xo, yo) is a 
Diophantine solution, of ax + by = c, then axg + by) = c. Therefore, from the 
previous paragraph, 


C = axy + byy = drxo + dsyy = d(rx0 + syo) 


which simply says that d|c (since rxp + Syp is an integer). 

Conversely, assume that d|c, say c = dv for some v € Z. We’ll prove that this 
assumption guarantees a solution to (2.3.1). The Euclidean Algorithm Byproduct 
(Theorem 8) shows that there are integers x and y which satisfy d = ax + by. 
When this relation is multiplied by v we see that 


c = dv= (ax + by)v = a(vx) + b(vy). 


Hence the Diophantine equation ax + by = c has x9 = vx and yo = vy as 
a particular solution. This computation proves the first part of our next theorem. 
Note that, in order to solve this problem about solutions of a Diophantine equation, 
we needed to use a lot of the material of the previous section. Now that we know 
the criteria for Diophantine solutions, we prove the fundamental theorem of linear 
Diophantine equations by characterizing the solution. 
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Theorem 9 (Existence and Form of Solutions to Linear Diophantine 
Equations). The linear Diophantine equation ax + by = c has a Diophantine 
solution if and only if c is divisible by d = GCD(a, b). Furthermore, if (Xo, yo) is 
any particular solution of this equation, then all other solutions are given by 


b 
x’ =xo+ (5) t y =yor- (<) t (2.3.3) 


where t is an arbitrary integer. 


Proof. We have already proven the assertion of the first sentence in the preceding 
paragraphs. To establish the second statement of the theorem, let us suppose that 
an integer solution (xo, yo) of the given equation is known. If (x’, y’) is any other 
integer solution, we now show that (x’, y’) takes the form of Eq. (2.3.3). Of course, 


ax + by) =c = ax' + by, 
which is equivalent to 
a(x’ — xo) = b(yo — y’). 


Thanks to Proposition 4 there exist relatively prime integers r and s such that a = 
dr, b = ds. Substituting these values into the previous equation and canceling the 
common factor d, we find that 


r(x’ — x9) = s(vo — J’). (2.3.4) 


Since r is a factor of the left hand side, r must divide the right hand side of (2.3.4) 
also since r|s(yo — y’). Since r and s are relatively prime, r cannot divide s and so it 
must be that case that r divides the other factor (see Proposition 6). Thus r|(yo—y’), 
or in other words, yo — y’ = rt for some integer ¢. Substituting this into Eq. (2.3.4), 
we obtain 


r(x’ — x9) = srt 
or 
/ 
x —Xo = St. 


Plugging the formula for x’ into ax’ + by’ = c, gives y’ which leads us to the 
formulas: 


: (7) 
xX =Xo t+ St=Xot+ Fi L 


ya nonem=(§) 


which are exactly Eq. (2.3.3). 
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It is easy to see that these values are integers which satisfy the Diophantine 
equation, regardless of the choice of the integer t because 


a + ox =alx0+ (5) 1] +8[v0~($) 4] 
= (axo + byo) + (S-S): 


=c+0:-t=c 
" (el 
Note, in particular, that Theorem 9 says that there are either no solutions or 
there is an infinite number of Diophantine solutions to the given equation, one for 
each integer value of ¢ in Eq. (2.3.3). A geometric way to consider this theorem is 
described in Problem 5 at the end of this section. 


Example 45. We will now use the method of Theorem 9 to solve the linear 
Diophantine equation 


16x + 39y = 10. (2.3.5) 


Note that not only do we have to show that Eq. (2.3.5) has a solution, but we actually 
have to find all of them! 


Solution. To see if there is a Diophantine solution to (2.3.5), we must first, 
according to Theorem 9, determine if d = GCD(39, 16) divides 10. A casual 
glance at 16 and 39 show they have no positive common divisors other than one, 
so GCD(16, 39) = 1. However, for practice, we will find d using the methods of 
the last section and then write d as an integral linear combination of 39 and 16. 
Applying the Euclidean Algorithm, we write 


39=2-16+7 
16=2-742 
(2.3.6) 
7=3-241 


2=2-1+[0] 


Therefore, GCD(39, 16) = 1 (16 and 39 are relatively prime) and so there is a 
solution to Eq. (2.3.5) thanks to Theorem 9. 

To solve for a particular solution (which is the next step), we must obtain the 
integer | as a linear combination of a = 16 and b = 39. As in the last section, we 
work up from the next to last equation of (2.3.6). 


1=7-3-2 
= 7—3(16—2-7) =7—3(a—2-7) 
= 7— 3a + 3(2-7) =7-7—3a. 
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Since the top equation is 7 = 39 — 2- 16 = b — 2a, the above simplifies to 
1 = 7(b — 2a) — 3a = 7b — 17a (2.3.7) 
Multiplying Eq. (2.3.7) by 10 
70b — 170a = 10 
16(—170) + 39(70) = 10. 


Thus, a particular solution of Eq. (2.3.5) is x» = —170 and yo = 70. All solutions 
are therefore expressed by 


x = —170+ 39t and y = 70— 16t (2.3.8) 


for some integer f. // 


There is another way to find a particular solution using an elementary method 
as follows. We solve the Diophantine equation 16x + 39y = 10 for x to get the 
equation x = — Thus 


10—7y 
16 


x= —2y + (2.3.9) 


and we are left with the following question. When is ath an integer? We must look 
for cases when the numerator is a multiple of 16. For example, in the case —480, 
10 — 7y9 = —480 yields yop = 70 and x9 = —170. Another solution can be found 
such as Xo = —14 and yo = 6. 

The above technique can also be used for general solutions. It can best be 
explained by an example as the following: 


5x + 22y = 18 
implies 


18 —22 a=3 
x= = 3-4y + s 


Since — € Z, we can write z = —> 


same argument, solving y for z, namely: 


= z. So 2y + 5z = 3. We now repeat the 


3—5z l-z 
oy aioe aa eee 
2 lear 


Let 5 =t,thenz = 1—2t,y = (l—2z)+t = 1—2(1—2t) +t = —-1+ St and 
18—22y __ 18—22(—1+5t) _ 8 —22¢ 
5 5 , 
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A third way to find a particular solution is to guess! For example, is the 
Diophantine problem 7x + 4y = 100 (which will appear in the solution to the 
Hundred Fowls Problem as Eq. (2.3.11)), the easy guess of x = O and y = 25 
jumps out at us. 


Corollary 4. [fa and b are relatively prime and if (Xo, yo) is a particular solution 
to the linear Diophantine equation ax + by = c, then all solutions are given by the 
pair (x, y) with 


xX = Xo + bt and y = yo — at 


for integral values of t. 


Here is an easy illustration. The equation 5x + 22y = 18 has (by guessing) 
Xo = 8, yo = —1 as a solution. Since 5 and 22 are relatively prime, Corollary 4 
gives a complete solution, x = 8 + 22t¢ and y = —1 — 5t for arbitrary integer, r. 


Definition. The solution of a Diophantine equation is a POSITIVE SOLUTION if 
each of its variables are positive. 


Example 46. What are the positive Diophantine solutions of 
172x + 20y = 1000. (2.3.10) 


Solution. In Problem 4 at the end of this section you will show that all solutions of 
Eq. (2.3.10) are of the form 


x = 500 + 5t and — 4250 — 43¢ fort € Z. 


To find solution with x > 0 and y > 0, f must be chosen to simultaneously satisfy 
these inequalities. In other words, it must be that both 


5t + 500 > 0 and — 43t — 4250 > 0. 


Therefore, or, what amounts to the same thing, 


4250 36 
——— = —98— >t > —100 
43 
Because ¢ must be an integer, we are forced to conclude that tf = —99. Thus, our 
Diophantine equation has a unique positive solution, x = 5, y = 7, corresponding 
tot = —99. Hf 


Linear indeterminate problems such as those of Diophantus have a long history, 
occurring as early as the first century in the Chinese mathematical literature. Owing 
to a lack of algebraic symbolism, they often appeared in the guise of rhetorical 
puzzles or riddles. The contents of the Mathematical Classic of Chang Chin-chien 
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(sixth century) attest to the algebraic abilities of Chinese scholars. This elaborate 
treatise contains one of the most famous (in the sense of transmission to other 
societies) problems in indeterminate equations, the problem of the “hundred fowls”. 
The problem quoted at the beginning of this section asks: 

Hundred Fowls Problem: (Burton [1]) If a cock is worth 5 coins, a hen 3 coins, and three 


chicks together 1 coin, how many cocks, hens and chicks, totaling 100, can be bought for 
100 coins? 


Thanks to the machinery that we have derived in the last and this section, the 
Hundred Fowls problem is straightforward to solve. Before you read further, think 
about how you would solve (regardless of whether you live in the twenty-first 
or sixth century) the Hundred Fowls problem before you had the foundation of 
Sects. 2.2 and 2.3. 

If x equals the number of cocks, y the number of hens, and z the number of 
chicks then the Hundred Fowls Problem becomes 


1 
ete ae a 100 and x + y+ z= 100. 


Eliminating one of the unknowns, we are left with a linear Diophantine equation in 
the other two unknowns. Specifically, because the second equation is z = 100—x — 
y, we have 5x + 3y + (100 — x —y) = 100. The Hundred Fowls Problem thus 
reduces to solving the linear Diophantine equation 


7x + 4y = 100. (2.3.11) 


We solve the Hundred Fowls problem using the results of Corollary 4, remem- 
bering that in this case x > 0, y > 0 and z => 0 (since you cannot have a negative 
number of fowls!). We are looking then for a solution in the non-negative integers 
only. 

To carry out our procedure, we need a particular solution to Eq. (2.3.11). Now 
there is one obvious solution, (0, 25). Corollary 4 shows this equation has the of the 
form general solution x = 4t and y = 25 — 7t wheret € Z. Becausex + y+z= 
100, z = 75 + 3t. Chang himself gave several answers: 


and 


and 
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A little more effort produces all solutions in the positive integers. To find solutions 
in the positive integers for x and y, ¢ must be chosen to satisfy simultaneously the 
inequalities 


4t > 0, 25 —7t > 0, and also 75 + 3t > 0. 


. Since 4t > 0, 


The last two of these inequalities are equivalent to —25 < ft < 34 
= 0,1,2,3, leading 


t must have a nonnegative integer value, we conclude that ¢ 
precisely to the values that Chang obtained. 


Problem Set 2.3: 


1. Which of the following Diophantine equations cannot be solved (that is, have 
no integer solutions)? 


(a) 3x + 15y = 21 
(b) 30x + 25y = 101 
(c) 2x + 1l4y = 91 


2. Determine all solutions in the integers, and positive integers, to the following 
Diophantine equations: 


(a) 56x + 12y = 60 
(b) 12x + 18y = 36 
(c) 101x + 1002y = 1 
(d) 5x +22y = 18 


3. Use the method of Theorem 9 to solve the linear Diophantine equation 172x + 
20y = 1000. 

4. Prove Corollary 4 by using Theorem 9. 

5. Here’s a geometric visualization of Theorem 9 which gives all of the solutions 
to a Diophantine equation. If (2.3.1) has a particular solution, (xo, yo), prove 
that the set of all solutions is the set of all points on the line through (x9, yo) 
with slope —| both of whose coordinates are integers. 

6. Prove that the Diophantine equation ax + by + cz = d is solvable if and only 
if GCD(a, b, c) divides d. 

7. (a) Is 15x + 12y + 30z = 34 solvable in integers? 

(b) If 15x + 12y + 30z = k is solvable in integers, then what are possible 
values of k? 


2.4 A Brief Introduction to Combinatorics 


Overview In previous sections, we have explored some techniques for solving 
certain linear Diophantine equations. We now turn our attention from looking for 
solutions to that of counting how many solutions there are to a given equation. This 
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will require first an introduction to the notions of permutations and combinations 
followed by their application to Diophantine equations. We do not assume that the 
reader has a thorough background in the subject, usually called combinatorics, 
although you will have seen the beginnings of combinatorics in high school, in a 
“math for future teachers” course, or more advanced courses which are dedicated 
to it. 

We then will give some examples but not examine this vast and interesting area 
as we are only interested in exploring the material for use in number theory. We start 
with two simple examples. 


Example 47. If Richard has 3 sweaters and 4 shirts, how many ways can he pick a 
pair consisting of a sweater and a shirt? 


Solution. If we list the shirts (sh) and sweaters (sw) as in the table below, then each 
pick of Richard’s can be viewed as an arrow joining one of the shirts to one of the 
sweaters. In the case indicated, it connects shirt 2 with sweater 2. 


shirts sweaters 
shl swl 
sh2 ————+ sw2 
sh3 sw3 
sh4 


Since we can connect a shirt with any of 3 sweaters, it seems clear that the total 
number of shirt/sweater pairs is 3 + 3 +3 + 3 = 3(4) = 12. The answer is just the 
product of the number of shirts and the number of sweaters. // 


It is important to note that picking a shirt does not influence at all which sweater 
is chosen. In such cases, we say that the choices are INDEPENDENT. We will be 
concerned nearly exclusively with counting independent events, but to illustrate any 
concept (such as independence), it is useful to give an example in which the concept 
doesn’t apply. 


Example 48. In a deck of playing cards there are 4 kings and 48 other cards. Jane 
is going to pick twice. How many ways are there 


(A) of getting two kings if Jane replaces the first pick, shuffles the deck and then 
picks again? 

(B) of getting exactly one king if Jane holds the first pick and from the remaining 
51 cards picks again? 


Solution. 


(A) Jane has 4 choices of a king the first time and 4 the second time so the answer is 
16. (The events of picking are independent—she’s doing the same thing twice). 
(B) In this case, we can’t just multiply because what happens on her first pick 
changes the number of choices. There are 4 choices of a king the first time she 
picks. We now must divide the possibilities into two cases. The first is that she 
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is successful on the first pick (she gets one of the 4 kings). She must then pick 
something other than a king and there are 48 ways to do that. Thus 


Number of ways of picking a king followed by another card which is not a 
king = 4(48) = 192. 


On the other hand, if she gets a non-king first (48 possibilities) and then a 
king (4 possibilities), there are (48)4 = 192 ways. Thus there are a total of 
192 + 192 = 384 choices. / 


What we see in Case B is that we must be careful as to whether the events are 
independent in Jane’s experiment. To be more precise, the event of picking a king 
and then the second event of another king from the original deck are independent 
(Part A). The event of picking only one king involves dependent events. This is the 
difference between picking with replacement versus without replacement. Note that 
the answer to Part B is NOT 4(48), which would be the answer to “first pick a king 
and then pick a non-king without replacement of the first card”. 

Deciding whether two events are independent can be tricky. Culture and taste 
come into the equation, as we can see in the next example. 


Example 49. A restaurant serves chicken, steak, tofu, beef, mashed potatoes, 
French fries, baked potatoes and green beans. How many different ways are there to 
select two items for your meal. 


Proof (Solution for Discussion). Think about and share ideas with friends for what 
a solution even means! Would you want 2 orders of mashed potatoes as your 
entire meal? (In the language of Example 48, this is the question of picking “with 
replacement” or “without replacement’.) 

Here are other “practical questions”: Would you want an order of chicken and an 
order of steak? Do you eat tofu? Are you a vegetarian? 

There is no correct answer to this problem until the exact criteria for the meal are 
specified. oi 


It should be apparent the notation for these types of problems could become very 
cumbersome if we must make many choices. To motivate our notation, suppose 
you have 4 red balls, 3 white ones, 2 green ones and a yellow one. Then there are 
4.3-2-1 = 24 ways of picking a red, white, green and yellow ball in that order. This 
kind of computation has a very specific formula which leads us to make a definition 
which helps with writing formulas succinctly. 


Definition. If 1 is a positive integer, » FACTORIAL, written as n!, is 
nl! =n(n—1)---3-2-1. 


We also define 0! as 1. Note that 3! = 3-2-1 = 6and 4! = 4-3-2-1 = 24. In fact, 
41= 4.3), 


Exercise 32. Prove that for an integer at least 1 


n!=n-(n—-1)! 
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Example 50. As in Example 48, Jane again wants to pick two cards but this time 
she doesn’t replace the first card into the deck. Assuming the order of kings matters 
(so getting the king of hearts and then getting the king of spades is different than 
getting the king of spades and then the king of hearts), how many ways are there of 
getting 2 kings? 


Solution. She can find 4 opportunities for the first king and 3 for the second. Thus 
there are 12 = 4-3 ways. // 


Note that the answer to Example 50 can be written as 


4! n! 


4-3 = — = ——_ 
2! (n—-r)! 


, Where = 4, andr = 2. 


This kind of problem (how many ways are there of picking r objects in order 
without replacement from a total of m objects?) comes up so often in problem 
solving that its solution has a special name, permutation, and symbol ,P,. 


Definition. Let n and r be non-negative integers, with n > r. A PERMUTATION of 
n objects taken r at a time is 


n!} 
eT naar 
Suppose now that we consider a different problem, one in which the objects we 
choose are not all different. 


Example 51. A n-CODE WORD is a string of 7 letters. (It need not make sense in 
any language). How many 5-code words can be made from the letters of the set 
{T, U,V, W, X} if 


(A) the 5 letters are distinct? 
(B) 3 of the letters are W and the others are T and U? 


Solution. 


(A) Since the letters are all different, there are 5 choices for the first, 4 for the 
second, etc. Thus the answer is 


5-4-3-2-1=5! 


which is 120. 
(B) To solve this part of the problem, let’s first make the W’s “look different” by 
adding subscripts to them. The possibilities are then 


T, U, W,, W2, W3. 


Since they are all different now, we know from Part A that the answer is 5!. 
However, since the W’s are all the same the following three different words 
are actually the same UW2TW3W,, UW, T W2W3, and UW2TW, W3. Each of 
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them is the code word UWTWW of the original problem. In fact, if U is in the 
first place and T is in the third then there are 6 code words which appear in 
Part A but would be identical in Part B. Thus the answer to Part B is 2 or x, 
since 6 = 3!. // 


Note that to avoid the cultural aspects of what a “word” is, we defined it carefully 
in the problem. Compare the statement of Example 51 with that of Example 49. 


Exercise 33. How many 5-code words from Part A of the previous example become 
UWTWW in Part B? 


From this discussion, we see that if there are n objects such that k of them are 
identical and the other (” — k) distinct and differ from the first k objects, then the 
number of code words is 7!/ (n — k)! Which is exactly the number of permutations 
of n things taken k at a time, ,,P,. 

Let’s move on to a more complex situation in which the same logic holds and 
introduce the idea and notation of “combinations,” in contrast to permutations. The 
key difference between the two is whether the order of the groupings matters. 

Suppose we have n different people and we wish to pick r of them (r < n) 
for a committee. This “committee problem” gives a good insight into the difference 
between a combination and a permutation. In a permutation, the order of choice 
matters (think about picking a president, a vice president, and treasurer of an 
organization). For a committee of three, it doesn’t matter in which order you pick 
the committee members—it’s still the same committee. 

Since the people of our committee are different, it might seem like the answer 
should be n(n — 1)---(n — r + 1). However, this is a committee so the order 
of choosing the people doesn’t matter. While people are different, what matters 
is whether someone is chosen for the committee or not. Two people are then 
“identical” for the problem we are considering if they are both on the committee 
or both not on the committee. Thus if we count permutation, we have counted the 
same committee r! times. (This is just like the code word problem where we counted 
the same code word 3! times.) The answer to the committee problem is obtained by 
dividing by the number of times we have counted the same committee. 


number of committees ofr members _ n(n — 1)---(n—r + 1) (2.4.1) 
of a population of n 7 r! 


This expression comes up so often in counting that it has its own name and symbol, 
nCy or ("). You should read ,,C, or (") as either “n choose r” or “the number of 
combinations of n things taken r at a time”. ,C, represents the number of ways of 
picking r objects (from n) without regard to order. 


Definition. If in > 0 andr > 0 are integers with n > r, then the COMBINATIONS 
OF n THINGS TAKEN Pr AT A TIME is 


ce Gal een 2.4.2 
n come ~ rin —r) (2.4.2) 
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Note that ("”) is exactly the expression in Eq. (2.4.1) (and we will use (") or ,C, 
interchangeably). 


Note (") is also the number of subsets of an n-element set that contains r elements. 


Example 52. By (2.4.2), we see that the number of subsets containing 3 elements 
that can be formed from the set {a, b,c, d, e} is (3) a = = ae = x = 10. 
The ten subsets in question are {a,b,c}, {a,b,d}, {a,b,e}, {a,c,d}, 


{a,c,e}, {a,d,e}, {b,c, d}, {b,c,e}, {b,d,e}, {c,d,e} 


Example 53. How many ways can a committee of four women be chosen from six 
women? 


Solution. It is equivalent to find the number of subsets containing 4 ele- 
ments that can be formed from a 6 element set. Therefore, the answer is 
()= 8 =8=15 M 
4) ~ 4a — 2 ~ *Ps 


Problem Set 2.4: 


1. Find the value of each of the following. 


100 15 11 
b 
(a) (‘°) ( (2) (c) € 
2. If Jeff has 3 sweaters, 5 shirts and 3 pants, then how many ways can Jeff pick 


an outfit (one sweater, one shirt, and one pair of pants) if 


(a) there is no restriction on his choice. 

(b) there is one red sweater, one red shirt, and one red pair of pants and all the 
other cloths are different colors (and not red) and Jeff won’t wear an outfit 
with two red items? 

(c) If there are 3 red shirts in Part B instead of one, answer the same question. 


3. If Maria is playing with a standard deck of cards, how many ways can she 


(a) pick two cards with replacement each of which is less than 6? (An ace 
should be counted as low in this exercise.) 
(b) pick two cards without replacement each of which is less than 6? 


4. (a) How many 6-code words can be made from letters in the English alphabet? 
(Do not do all of the arithmetic to get a specific number; a formula is fine.) 
(b) Answer Part (a) if the first choice must be a vowel (the letter y is not a 
vowel for this exercise.) 
(c) Answer Part (a) if there must be two vowels in each of the 6-code words. 
5. Are the next equations true for positive integers n and m? In other words, either 
prove or give a counterexample to each of the following assertions: 


(a) (n?)!=n!n! 


(b) (n+m)!=n!4+m! 
(c) 2) = n(n—1)---(n—m) ifn >m 
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6. Show that (2.4.2) and (2.4.1) are the same expression. 

. Prove that ("") = (,"_). 

8. Prove that ("") = Cs) + () by using (2.4.2). Can you prove it without doing 
anything formal? (see Problem 7) This identity was proven by B. Pascal (1623- 
1662). Therefore it is called the Pascal’s identity. 

9. A student must answer six of the ten questions on a final exam. How many 
different ways can he complete the exam. 

10. (2) = 2,598960. If (77) = 2,598960 and k # 5, then k is equal to what 
integer? 

11. A committee of three—Chair, Secretary, and Treasurer—is to be elected by a 
club with 14 members. If every member is eligible to stand for each position, 
how many different committees are possible. 


12. What is the sum of (§) + (*) + (3) pee tf (3)? 


~ 


It’s interesting to see that Problem 7 has to be true without doing anything formal 
(although you should). To form a committee of r people from n, we simultaneously 
form a committee of (7m — 1) also (the ones who were not chosen). Thus the number 
of ways of choosing r objects from n should be the same as the number of ways of 
choosing (n —r) fromn. Problem 7 is a formal statement of this heuristic argument. 


2.5 Linear Diophantine Equations and Counting 


We now turn our attention on counting how many solutions there are to a given 
linear Diophantine equation with unit coefficients. We begin with a simple example. 


Example 54. How many positive integer solutions are there for the equation x + 
y+z=6? 


Solution. The number of solutions is so small that they may be listed easily. In fact, 
the possibilities for (x, y, z) are: 
(4,1, 1), (3,2,1), (3,1, 2), (2,3, 1), (2,2,2), (2,1,3), 0.4, 1, (1,3, 2), 
(1, 2,3), (1,4) // 
There is an easy method for finding the number of positive solutions to such 
equations, suggested by the following equation: 


14+14+14+1+1+1=6. 


To select a solution (x, y, z), we need only chose two of the plus signs to break the 
sum up into three parts. For instance, 1 6 1+1+161+1 = 6 corresponds 
tol +3+2 = 6 or the solution (1, 3,2). This can be done in (:) = 10 ways. 
Generalizing this example we have the following theorem. 
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Theorem 10. The equation x, +.x2+%3+++:+X, = m has exactly C_ POsSitive 
integer solutions. 


Proof. Each choice of n — 1 plus signs from the m — | plus signs in the sum 
corresponds to a positive integer solution. fo 


Example 55. How many positive integer solutions are there for the equation x; + 
Xo + x3+ 2x4 = 16? 


Solution. Here, m = 16 andn = 4, so there are (3) = 455 positive integer 


solutions. // 


Example 56. In an undergraduate dormitory there are several freshmen, sopho- 
mores, juniors, and seniors. In how many ways can a team of 12 students be chosen 
such that the team contains at least one student of each year? 


Solution. Let x,,x2,x3, and x4 be the number of students that are freshmen, 
sophomores, juniors, and seniors in a team respectively. Then x; +x2+x3+X4 = 12. 
xX, > 0, X2 > 0, x3 > 0, x4 > 0. Thus by Theorem 10 with m = 12 andn = 4, we 


obtain 
12-1 11 11! 11-10-9 
a a a = 165 
4-1 3 318! 3-2-1 
// 
To find the number of solutions in non-negative integers to our equation x; + 
X2 +++++X, =m, we have the following theorem. 


m+n—1 
n—-1 


Theorem 11. The equation x; + X2 +++: +X, = m has exactly ( 
negative integer solutions. 


) non- 


Proof. Let yy = x1 + 1, yo = x2 + 1, 93 = x3 + 1,°++, Wy = X + 1. Thus, 
xX) = yj —1,X2 = yo -1,°++, X, = yy, — 1. Substituting into our equation gives 


Oi De Dee Ore Dam. Le. Yip yar ie = eR 


which has Ca) positive integer solutions. Therefore, x; + x2 +--- +x, =m 
m+n—-1 


si ) non-negative integer solutions. @ 


has exactly ( 


Example 57. A bakery makes 5 different types of donuts. How many different 
assortments of one dozen donuts can be purchased? 


Solution. Let x; (@ = 1,2,3,4,5) denote the 5 different types of donuts. The 
number of different assortments of one dozen donuts can be rephrased as the number 
of possible non-negative integer solutions to the following equation. 


Xp tx. +--+ x5 = 12, x; > 0. 


Which, by Theorem 11, is ('7*°7') = 1,820. iI 
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Example 58. Nine professors are eating lunch together at a restaurant that offers 
four different desserts. How many different selections of desserts can be ordered by 
the group? 


Solution. Let x; (i = 1,2,3,4) denote the different deserts offered by the 
restaurant. Thus, the number of selections of desserts that can be ordered is the 
number of non-negative integer solutions to the following equation. 


Xp + xX. +43 4+ x4 = 9. 


Thus, by Theorem 11, the answer is (| = 220. // 
Note If we wanted to allow the possibility that some professors might order no 
dessert, then there are 5 types of individual dessert orders. 


Finally, suppose we are interested in restricted solutions to an equation. 


Example 59. How many integer solutions are there to the equationx + y+z+tf= 
19 with x > 0, y > 7,z,t => 0? 


Solution. Let uj) = x, uw = y —6,u3 = z+ 1 and u4 = t + 1. Then the original 
equation is equivalent to 


utigt+6+u3-—1l+u,—1= 19, Le., uy tu + 3 + U4 = 15. 


Thus, by Theorem 10, we have Ca) = 364 solutions. // 
Note We could have used Theorem 11 by setting uw; = x — 1, uw = y —7, u3 = z, 
and u4 = ¢. 


Example 60. In how many ways can a team of 12 be chosen such that it has at least 
one freshman, at least one sophomore, at least two juniors, and at least three seniors? 


Solution. Let x1, x2, x3, x4 be the number of students from each year. In this case, 
we have 


Xp+xX2+%3+%4= 12, KF 1, 22 21,543 > 2, x4 > 3. 


Let uy = X1, Uy = X2, U3 = X3— 1, and ug = x4 — 2. Now, 
wWtwtwyt+lt+uy4+2=12 Sutmwmtuw+u4=9 Wy, U0, U3,U4 > 1 


By Theorem 10, we have n = 4 and m = 9 and the answer is i = 56. // 


Problem Set 2.5: 


Find the number of integer solutions to the following equations satisfying the 
given restraints. 
l.xtyt+z=8 x21, y20,z20. 
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~xX+tytztu=15,x>1ly>1,z>0,u=2. 
~xXx+y+z=10.x>0,y >2,z> 1. 
. In how many ways can 20 pieces of fruit be selected from a basket of bananas, 


cherries, and apples with at least 1 piece of each type of fruit being selected? 
How many different baskets of fruit can be prepared? 


. The University of Nevada, Las Vegas represents a large group of freshmen, 


sophomores, juniors, seniors, and graduate students. Find the number of ways 
of forming a team of 20 UNLV students such that 


(a) at least two members of the team are seniors and there is at least one member 
from each of the other groups. 

(b) atleast two of the team are seniors and the team contains atleast one graduate 
student. 


. A group of nine children have set out to find 72 tennis balls around a group of 


trees next to John’s home to put in their own box. In how many ways can the 
balls be distributed in the boxes? 


. A collection has a large set of roses which are red, yellow, pink, and orange. How 


many different combinations (counting color) can be obtained by choosing ten 
roses? 


. A restaurant offers five different juice drinks at a price of $2 each. How many 


selections of juice can a customer with $20 order? 


. What is the number of non-negative integer solutions of x + y+z+u < 14? 


(Hint: x + y+z+u< 14ifandonlyifx+y+z+u+t= 1l4witht > 0.) 


Chapter 3 
Equivalence Classes with Applications 
to Clock Arithmetic and Fractions 


Introduction We will first define a quite unifying concept: what it means for 
two objects to be equivalent (rather than equal), and apply it in the next section 
to modular (or clock) arithmetic and in the following one to establish a rigorous 
definition of fractions. The notion of equivalence is pervasive in mathematics and 
will be a part of many courses you will take in the future, including abstract algebra 
and geometry. The last section shows that there are fascinating applications from 
number theory. 


3.1 Equivalence Relations and Equivalence Classes 


Overview While modular arithmetic (the arithmetic of a clock is an example) is 
quite intuitive, fractions are not. The arithmetic of fractions is generally difficult 
for children to learn, hard to truly understand conceptually, extremely difficult to 
define carefully, and applicable. Modular arithmetic is interesting and widespread. 
Mathematics requires clear, concise definitions. Before we can ask whether 7, 1, or 
5 are prime numbers, for example, we must have a precise definition of the concept 
“prime number’. Why is | not a prime? Because the very definition of a prime 
number excludes 1 being a prime number (and all mathematicians have adopted 
that definition). Occasionally there are definitions which differ in various books, but 
that occurs rarely. Because definitions are so important in mathematics, going back 
to the definition of a concept may be necessary in order to truly understand what a 
question is asking or why a proof is constructed the way that it is. Thus “looking at 
definitions” can be thought of as a problem solving strategy. 

Some concepts are easily defined (e.g. a square is a rectangle with all sides 
the same length) and others are more difficult. The next two concepts of the text 
(modular arithmetic and fractions) both require the notion of an equivalence relation 
on a set. 
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What is a succinct way of differentiating between equality and equivalence? 
Consider a room full of people. If we are interested in determining a fine basketball 
team of five of these people then the quality of the basketball skills matters greatly — 
that is, each person will add something different to the team. No person is “equal” 
to another in any reasonable sense. On the other hand, if we are interested in 
performing interviews which requires siblings then we don’t really care about 
individuals, only whether two people are related as brothers, sisters, or brother/sister. 
While the brothers Philip and Steven are different, they are the same for our 
experiment. We would say then that Philip and Steven are “equivalent under the 
sibling relationship” but not equal. 

We will now formally define the products of sets, relations on a set and 
equivalence relations. These notions are the heart of much of mathematics and 
certainly are the backbone of the next two sections. You will use them often in 
future courses. 


Definition. If A and B are sets, the CARTESIAN or CROSS PRODUCT of A and B is 
Ax B= {(a,b)|ae¢Aandbe B}. 


A x B consists of pairs of elements of A and B (in that order). 


Example 61. R? is defined in most text as R? = {(x, y) | x € R, y € R} and so 
R* = R x Ris the plane, as usual. 


Informally, a relation on a set A is a way of deciding whether, given a), a2 € A, 
a is related (as in the sibling example above) to a2. Intuitively, a relation on a set is 
something where we can determine whether or not two elements are related (true) 
or not (false). 


Definition. Let A be a set. R is a RELATION on A if R is a function from A x A 
to the set of two elements {T, F'}. If R(a,,a2) = T, we say that a; and az are 
RELATED BY R. If R(a,, a2) = F,, then a; and az are NOT RELATED BY R. 


When you read the next paragraph and the examples, keep in your mind the 
“sibling relation” for the set of people A, JT = true and F = false. In that context 
R(a\, a2) = T means that person a; and person dp are “in truth” brothers or sisters 
whereas R(a),a2) = F means that person a, and a are not siblings. 

Going back to the general case of R being a relation of a set A, we are thinking 
that, if aj,ad. € A then R(a;,a.) = T (for true) if a; is related to az and 
R(a\, a2) = F (for false) if a, is not related to a). The key here is that the defining 
property of R is either true or false but not both. Since we’ll usually be dealing with 
relations defined by equations, it will be easy to decide whether or not the numbers 
are related. It is usually easier to write a; Raz to mean that a, is related by R to az 
or R(a), a2) = T. We'll use both notations as is customary in most books. 


Definition. Let A = R! and a,,a) € A. Then we define the ABSOLUTE VALUE 
RELATION to be a; Raz = T exactly when |a;| = |a2| (so a;Ray = F when 
|a1| A |a@2|). Thus a; and a are related under the absolute value relation if and only 
if |ai| = |ap|. 
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Definition. Let A = R*. If a; = (x1, y1) and az = (x2, y2) then we define the 
NORM RELATION as a;Ray = T if x7 + y? = x} + yz and R(a),a2) = F if 
xi typ Ax t+ y}. 

In the first definition above, 1 is absolute value related to —1 and to itself, but to 


no other real number. In the second definition above, note that (5 +) is norm 


J? 
related to (-4. 3) since the sum of the squares of their coordinates is 1 in either 
case. If fact, (x1, yi) and (x2, y2) are norm related exactly when they both lie on the 
same circle about the origin of the same radius provided that neither is (0,0). The 
point (0, 0) has no other element of R? “norm related” to it, whereas all other points 
do have (many) other points which are related to them. 

There will be two relations that will be especially important to us: relating 
integers (which we will define here and use in modular arithmetic in Sect. 3.2), and 
relating fractions in Sect. 3.3. 

Suppose that we are interested in a relation based on whether two integers a; and 
az give the same remainder on division by 3. This would mean that 2 is related to 
5, 11, and 20 (among others) but 2 is not related to 4. Can we give the relation in 
other terms? Yes; to say that a; and az have the same remainder on division by 3 
is to say that a, — az is divisible by 3. For example, in this relation 2 is related to 
20 because both have remainder 2 on division by 3. We are thus led to the formal 
definition of relation modulo 3 on the set of integers; a; Raz if there is an integer k 
such that a; — a2 = 3k. More generally, 


Definition. Let 7 be a positive integer (n > 1). We say that a, and az, both integers, 
are CONGRUENT MOD n (or just simply CONGRUENT) if n|(a, —d2) or equivalently, 
there is an integer k such that 


a, —da,=kn. (3.1.1) 


We write a) = a2 (mod n) if a; and a2 are congruent modulo n and may say that 
a, is congruent mod n to a2 as a shorthand. 


Example 62. Show each of the following. 


A. 5= 17 (mod 6). 
B. —1 ¥ 23 (mod 21). 
C. —15 = —3 (mod 6). 


Solution. Each of these amounts to using the definition which is contained in 
Eq. (3.1.1). That is, given a), a2 and n, is there ak € Z which solves (3.1.1)? 


A. 5-17 = 12 = 2(6) sok = 2 works in Eq. (3.1.1), and, yes, 5 = 17 (mod 6) 

B. —1 — 23 = —24 but there is no integer, k, with —24 = 21k, so -—1 # 23 
(mod 21). 

C. —15 — (—3) = —12 = (—2)6, so —15 = —3 (mod 16). // 
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Example 63. 


A. Show that two integers are congruent modulo 2 if and only if both are even or 
both are odd. 
B. Show that every integer is congruent to 0,1 or 2 modulo 3. 


Solution. 


A. Suppose that a; and az are given and both are even. (The case where both a, 
and a2 are odd is Problem | of Problem Set 3.1.) Then there are two integers k, 
and k» such that 


a, = 2k, and a2 = 2k. 


Thus a; — az = 2(k, — kz). Because we may let k = k, — k2 in Eq. (3.1.1) we 
have that a; = a2 (mod 2). 

Now suppose da, is even and a2 is odd. We must show that a; and a> are not 
congruent modulo 2. Once again, there are kj,ky € Z so that a; = 2k, and 
ay = 2k» + 1 so that 


a, —a2 = 2(ky —ky)- 1. 


Thus a; — a is not divisible by 2 (since —1 isn’t). 

B. Leta € Z. We will show that a is congruent to 0, 1, or 2 modulo 3 by showing 
that a = r (mod 3) is the remainder of a on division by 3. The Division 
Algorithm (Eq. (*) of Sect. 1.1) shows that there are integers q and r with 
0 <r <3 such that 


a=q3+r 


ora —r = 3q. Since 3q is divisible by 3, so is a — r and so a~3r. Because 
0<r<3,r=0,1l,or2. // 


The example of relations modulo n is important in number theory and gives the 
basis for clock arithmetic that is covered in elementary school. To say that a; is 
congruent to a2 modulo n means that a, and az differ by a multiple of n. This is the 
same as saying that a; and az have the same remainder on division by n. For the 
clock interpretation, we’ll use n = 12. Informally, we identify (when not in Europe 
or on Military time) 13 o’clock with 1, 18 o’clock with 6 and 0 o’clock (midnight) 
with 12 (noon). 

Here’s a problem in clock arithmetic. What is 3h after 10 o’clock (on a 12h 
clock)? It is 1 o’clock, of course, but think what 10 + 3 is. 10 + 3 = 13 is not 
equal to 1, but it is related to 1. This easy computation hints that there is a similarity 
between numbers that are equivalent modulo 12 and the arithmetic of clocks as we’ Il 
see in Proposition 8 . 

We shall now define certain properties of relations which are appealing (some 
properties require that an element be related to itself, others are symmetric, etc.). 
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Definition. Let R be a relation on a set, A. 


R is a REFLEXIVE RELATION if aRa for alla € A. 

R is a SYMMETRIC RELATION if aRb implies that bRa for alla,b € A. 

R is a TRANSITIVE RELATION if aRb and bRc means that aRc for all a,b, 
andc € A. 


Definition. Let P = (xo, yo) be an element of R?. The NORM of P is 


|P| = xo + x. 


Example 64. If A is the set of all women in your town, and R is the relation “is the 
sister of,’ is R a symmetric, reflexive, or transitive relation on A? 


Solution. R is certainly symmetric, but most would say that Sandy is not the 
sister of Sandy so R is not reflexive. Whether R is transitive depends on your 
definition of “sister”. If it includes half-sister then it is possible for (Sandy) R(Cathy) 
and (Cathy) R(Judy) but for Sandy to not be related to Judy. (Why?) Furthermore, 
(Cathy) R(Sandy) and (Sandy) R(Cathy) but is it true that (Cathy) R(Cathy)? // 


Exercise 34. Show that the absolute value and norm relations are symmetric, 
reflexive and transitive. 


Definition. We define the GREATER THAN OR EQUAL RELATION on Z to be aRb 
ifa>b. 


Exercise 35. Prove or find a counterexample to each of the following statements. 


A. The Greater than or equal to relation is symmetric. 
B. The Greater than or equal to relation is reflexive. 
C. The Greater than or equal to relation is transitive. 


Definition. A relation R is an EQUIVALENCE RELATION if R is reflexive, symmet- 
ric, and transitive. If R is an equivalence relation, we say that a is EQUIVALENT 
(UNDER R) to b if aRb or, more frequently, an equivalence relation is written ~ or 
aw~b. 


An easy example of an equivalence relation on the set of all finite sets is to say 
that S, RS> if S; and Sj have the same number of elements. While it is easy for us 
as adults, that relation is quite abstract for young children. Watching a 3, 4, 5, or 6 
year old struggle with why three bananas are “equal” to three apples is fascinating. 
In our language, the sets are not equal, they are just equivalent under the relation 
“same number of elements’. As adults, we are so used to this abstraction that we 
don’t even think about it. On the other hand, for people new to the idea of sets 
and the number of elements in them (e.g. children), it is difficult. While we are 
most certainly not advocating teaching set theory and equivalence relations to young 
children, our point is that the concepts which, when first learned, were abstract and 
difficult become automatic with practice and understanding. 


Exercise 36. Is the “absolute value relation” from the definition at the beginning of 
this section an equivalence relation? Why? 
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Proposition 7. For any integer n > 0, congruent modulo n (=,) on Z is an 
equivalence relation. 


Proof. Letn > 0. To show that congruent modulo n(=, ) is an equivalence relation, 
we must prove the =,, is reflexive, symmetric, and transitive. 


Reflexive: If a € Z, we need to show that a =, a. Looking at Eq. (3.1.2), that is 
the same as showing that there is ak € Z such thata —a = kn. Setting k = 0 
shows that there is such a k and soa =, a for anya € Z. 

Symmetric: Let a,b € Z. Assume that a =, b. We will show that b =, a. Ifa =, 
b, then there is an integer k with a — b = kn (3.1.1) so that (b — a) = (—k)n. 
Because (—k) is also an integer, b =, a and so =, is symmetric. 

Transitive: Let a,b, andc € Z. To show that congruent modulo n is transitive, we 
assume that a =, b and b =, c and show that a and c are related modulo n. 
We know from the definition of =, that there are integers k, and k, with 


a—b=knandb—c=kon. 


Adding these two equations gives a—c = (k, + k2)n. Since kj + ko € Z, 
a =, c and =, is transitive. || 


Language. Because congruent modulo n is actually an equivalence relation, we 
will say that “a is equivalent to b modulo n” or “a is equivalent to b mod n” to 
mean the same thing as “a is congruent to b modulo n” and use the same symbol 
a =, b for both. 


The usual usage in mathematics is “a is equivalent to b modulo n” but we wanted 
to point out that if a relation has the name “equivalence” in its title that doesn’t mean 
it automatically is an equivalence relation. We still must prove that it is reflexive, 
symmetric and transitive. 


Definition. If R is a relation on the set A anda € A then the RELATION CLASS of 
a with respect to R, written [a] p, is 


[a]p = {b € A| aRD}. 


If R is the relation “congruent modulo n” and a € Z we will write [a],, to be the 
CONGRUENT CLASS of a modulo n. If the relation being used is obvious, we 
may write [a] instead of [a],, where n is a positive integer. 


We have already done examples which involve the congruent class. In particular, 
Example 62A shows that 5 is in the congruent class of 17 modulo 6 (or, in symbols, 
5 € [17], and 17 € [5],). Example 62B is the statement that —1 ¢ [23],,. On the 
other hand, Example 63A gives a complete description of the congruent class with 
respect to “related modulo 2.” There are two different congruent classes, one is the 
set of even integers and the other is the set of odd integers. In symbols 


Z, = odd integers U even integers (3.1.2) 
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or 
Z = [0], V [1], = [8 U [7]. - (3.1.3) 


The equality (3.1.3) is a bit subtle. We are not asserting that 0 = 8 or that 1 = 
—7, only that [0], = [8], = {even integers} and [1], = [—7], = {odd integers}. 
These last statements become 0 ~2 8 and 1 ~2 —7. Notice that there are two different 
congruent classes; they are either disjoint or equal and their union is Z. 


Example 65. Let R be the norm relation defined earlier. If (xo, yo) € R* andr = 
4/ xe + ve then the relation class of (xo, yo) with respect to ~ is a circle of radius r 
about the origin unless r = 0, in which case [(0,0)]p = {(0, 0)}. 


Solution. Let (xo, yo) €?. Then 
[(xo, yo]. = {(%, y) ER? | x? + y? = x9 + Yo}. 


Thus [(xo, yo)lz = {(x, y) € R? | x? + y? =r}. The right side of the last equation 
is exactly the equation of a circle with radius r about the origin unless r = 0. If 
r = 0, there is exactly one point (x, y) such that x? + y? = 0 and that is the origin. 
// 


Notice that, in this case, any two relation classes are either disjoint or equal and 
the union of all of them is the original set R?. Is that situation always true for any 
relation on a set A? It is if ~ is an equivalence relation, as we will see in important 
Theorem 13. On the other hand, this will not be true for every relation. 


Exercise 37. Show that for the “greater than or equal to” relation on R of 
Exercise 35, that 


A. [0] is the non-negative real numbers. 
B. Show that there are a,b € R such that [a] ¥ [b] but [a] N [b] 4 9. 


Exercise 37B shows that, for some relations, it may not be true that relation 
classes are either disjoint or equal. 


Definition. If ~ is an equivalence relation on A anda € A then we call [a]. the 
EQUIVALENCE CLASS of a with respect to ~. Note that every equivalence class is 
a relation class. Elements in the same equivalence class are said to be EQUIVALENT 
(under R). We’ll drop the subscript ~ on the equivalence class symbol if it’s clear 
which equivalence relation is being used. 


Exercise 38. Using the equivalence modulo 3 relation or “congruent modulo 3”, 
what is 

A. [0] B. [1] C. [2] 

D. [3] E. [4] F. [5] 

G. [—1] H. [-2] I. [-3] 

Hint: The answer to each part is a set. In fact, each is a subset of Z. We have written 
[0], for example, to mean [0]. 
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One of the ways for setting up rigorous foundations for clock arithmetic and 
fractions is contained in the next theorem. This result shows that an equivalence 
relation on A breaks A into a collection of subsets in which any two elements of 
any given equivalence class are related and no element is in 2 distinct subsets. This 
result (Theorem 12 ) is anticipated by Examples 63A and 65. 


Theorem 12. /[f A is a set and ~ is an equivalence relation on A, and a and b are 
elements of A then either [a] M [b] = or [a] = [5]. 


Proof. The logical chain of this proof is to assume that [a] M [b] is not the empty set 
and then conclude that [a] = [5]. If [a] N [b] 4 9, there is an element c of A with 
c € [a] andc ¢€ [b]. Thus 


a~candcw~b. (3.1.4) 
From transitivity, (3.1.4) shows that 
a~b (3.1.5) 


and we will use that fact to show that the two sets [a] and [b] are equal. 

To demonstrate that two sets are equal we need to show that every element of the 
first is a member of the second and every element of the second is a member of the 
first. 

We will start with d € [a], an arbitrary element and show that d is also an 
element of [b]. Since d € [a], a~ d, but (3.1.5) says that a ~ b so, by transitivity 
and symmetry, b ~ d. Thus d is related to b andd ¢€ [b]. A similar argument shows 
that if m € [b] then m € [a] and so the proof is established. | 


Another way of stating this result is two equivalence classes are either equal or 
they have no elements in common; that is, the equivalence classes are DISTINCT. 


Theorem 13. Jf A is a set ~ and is an equivalence relation on A, then A can be 
written as the union of equivalence classes in which no two distinct classes overlap. 


Proof. Since A is the union of all equivalence classes (because any element is 
equivalent to itself), the theorem follows immediately from Theorem 12. a 


Problem Set 3.1: 


1. What is: 


(a) The set of points related by the absolute value relation to 0? 
(b) The set of points related by the absolute value relation to 1? 


2. (a) The set of points related by the norm relation to (1,0)? 
(b) The set of points related by the norm relation to (—1, 0)? 
(c) The set of points related by the norm relation to (0, 0)? 


3.2 


1S) 


Nn 


10. 
. If ZF is the set of all triangles in the plane and R is defined for T; and T» 


12. 


13. 
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. Show that any two odd integers are congruent modulo 2. 
. For the equivalence relations on Z given by =, find the number of distinct 


equivalence classes if 


A. n=3 
B.n=4 
C.n=12 


D. n arbitrary 


. Using the relation =, what are other names for [0] and [1]? 
. Using the relation =12 is there anything in the room you are in which looks like 


the collection {[0] , [1], [2],..., [11]}? Hint: Is there a clock on the wall? 
Definition. For a,b € R, we define the LESS THAN RELATION by 


A~tess b ifa <b 


. Answer the following questions for the less than relation. 


A. What is the relation class of 1? 

B. What is the relation class of 2? 

C. Is Theorem 12 true for ~jes;? Why? 
D. Is ~tess an equivalence relation? Why? 


. Let Y be the set of all lines in R? (the plane). We say that for £), £2 € ZY, £ 


is “parallel related” to 2 if £; and £2 are parallel. Is being parallel a symmetric, 
reflexive, or transitive relation? (Hint: Look up the definition of parallel lines 
from a geometry book). 


. Is the “norm relation” from the definition at the beginning of this section an 


equivalence relation? Why? 
If the relation R on the set A is reflexive, prove that, for any a € A,a € [a]p. 


as T,| ~,; T> if the angle measures of 7| are equal to the corresponding angle 
measures of 7, show that ~, is an equivalence relation. What do you think the 
letter s might stand for in the symbol ~,? 

Suppose that a,b € Z. Define a relation R is by aRb if a* —b? is divisible by 9. 
Is R an equivalence relation of Z? 

If OQ is the relation of R? given by O((x Yi); (2, y2)) is true if x; = x2 


(a) Prove that Q is an equivalence relation. 
(b) What are the equivalence classes for 0? 


3.2 Modular (Clock) Arithmetic Through 


Equivalence Relations 


Overview In the last section, we gave the general definition of equivalence 
relations on a set A, showed how relations determine equivalence classes, and how 
the equivalence classes partition the set A into a collection of subsets which were 
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either identical or pairwise disjoint (Theorem 13). We will now present, in the next 
two sections, two striking examples of the use of equivalence relations. This section 
provides a rigorous foundation for modular or clock arithmetic and the next one 
provides the rigorous basis for fractions. 

We will devote this section to the example of the integers Z with the equivalent 
modulo n and visualize the equivalence classes as a clock. Working with the integers 
and the “equivalent modulo n” is called MODULAR ARITHMETIC. 

We will use the usual clock (using equivalent modulo 12) as an example during 
the section. With the experience you have from the preceding section, you will be 
able to show the following 


Example 66. Letn = 12. Each equivalence class (under equivalence modulo 12) is 
either 


[0] [1]. [2]. [3].-.-, 0], 111] 


where we have dropped the subscript 12. 


Solution. From Theorem 12, we need to show that any two equivalence classes are 
equal or disjoint. So we really need only show that any integer is in one of these 
classes. We carried out the process for n = 3 in Example 63B and the procedure is 
similar. 

Let a € Z. The Division Algorithm shows that there are g,r € Z such that 


a= 12q +r where0 <r < 12. 


Since g € Zanda—r = 12¢ is divisible by 12, a =). r. The proof is complete 
because the remainder r lies between 0 and 11 (inclusively). // 


This result allows us to picture the collection of equivalence classes (mod 12) as 
a clock. (We could write [12] instead of [0] as they are equal sets but it is better to 
use [0] as indicated by the division algorithm, and soon we’ll see why.) We are thus 
led to make the following (Fig. 3.1). 


Definition. The set of equivalence classes of Z with respect to the “equivalence 
relation modulo n” is called the INTEGERS MODULO n and is written Z,. 


Notation. Z,, has n elements: equivalence classes of 0, 1,...,2—1. To simplify the 
notation, we shall write k to represent the equivalence class [k], under ~,, of the 
integer k since n is fixed. Of course, k =k +nsincek~, k +n. We continue to 
write a = b modn fora~, b. 


Fig. 3.1 Usual 12-clock vi 
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Fig. 3.2. The n-clock 


A clock with n numerals, labeled 0,1,...,7 — I, can be thought of as a physical 
representation of Z,. Figure 3.2 has such a clock (and you’ve seen many with 
n = 12 although 12 (not zero) is used on the clock’s face). 

The point of these definitions is that we can put clock arithmetic on a rigorous 
foundation; that is, we can now add and multiply clock numerals. Here is an example 
on the 12-clock. 

When we ask people what time it is 5h after 11, they will either count it on a 
clock or their fingers (having 12 fingers would make that easier!) or occasionally 
say that “5 + 11 = 16 but we are working on a 12h clock so 16 is really 4.” If 
pressed, they may actually say that 4 comes about because 16 — 12 = 4. What these 
folks are really doing is modular arithmetic with n = 12. 

In our language, with n = 12, we are asking them to add 5 to [11] (not 11) and 
they are thinking [11] + [5] = [16] = [4] where the last equality holds because 
16~,24 or 16 = 4mod 12. 

How would we guess to multiply on a 12h clock? What would, say, 3 times 
eleven o’clock be? We regard multiplying as repeated addition so 


3 x [11] = [11] + [11] + [11] = [33] = [9] 
since 33 = 9 mod 12. Thus, we might guess that we should define 
[3] x [11] = [33]. 


The outline above motivates the definitions we now write down. The definitions 
of multiplication and addition give the structure of arithmetic on a clock. 


Definition. If k and @ € Z,,, then the PRODUCT of k and £ is defined to be 
k@=ké = kl). (3.2.1) 
The suM of k and t, is defined to be 
kK+€=k+l=[k +4]. (3.2.2) 


The product and sum above define the MODULAR ARITHMETIC of Z,,. 
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Is there anything we must do with these definitions before we start to use them? 
Most would say “no”, but there really is! We'll use addition as an example. What is 
[2] + [3] when n = 4. The definition says that 2 + 3 = 5, or 


[2] + 3] = [2 +3] = [5]. (3.2.3) 


But (and this is the key point), there are many ways to represent 2 (or [2]). For 
example, both 2 and 6 represent the same equivalence class 2 since 2 € 2 and 6 € 2. 
Thus we could have written 


[6] + [3] = [6+ 3] = [9] (3.2.4) 


instead of (3.2.3). In order for addition to be WELL DEFINED, it must be that the 
answers from (3.2.3) and (3.2.4) must be the same, and they are. After all, although 
9 and 5 are different integers (5 4 9), the question is whether the set [5] is equal to 
the set [9] or not under the equivalent modulo 4 (not whether 5 = 9). Since 9—Sisa 
multiple of 4, 9 = 5 mod 4 and so they both define the same equivalence class, thus 
[9] = [5]. At least in this case, addition is WELL DEFINED, that is, the operation for 
addition does not depend on the representation chosen for the equivalence classes. 
But, we must prove that both addition and multiplication are well defined for all k 
and £. The definition of “well defined” is contained in the statement of Proposition 8. 
Roughly, it says that addition (or multiplication) will give you the same equivalence 
class no matter which representatives you use to add (or multiply). 


Proposition 8. For the integers modulo n on Z, if |k1| = |k2] and [€1] = [€2] then 


A. (ky +1) =n (ko +2) and thus [k,; + €,| = [kz + £2] (addition is well defined). 
B. (k, £1) =n (K22) and thus [k, £1] = [k2£.| (multiplication is well defined). 


Proof. We leave Part B as Problem 2 of Problem Set 3.2. For Part A, since [A,] = 
[k2], k1 =n kz and so there is an integer p with k,; — ky = pn. Similarly, there is an 
integer q with 0; — 0) = qn. Thus 


(ki + £1) — (ko + £2) = (p+ q)n 


and therefore k; + €; and ky + €, differ by a multiple of n. We conclude from 
the definition of integers modulo n that (ky + €2) =n, (kz + £2) and so those two 
elements determine the same equivalence class, that is, [k, + €;] = [k2+ 2]. 


We may now use Eqs. (3.2.1) and (3.2.2) to define addition and multiplication 
modulo 7 because Proposition 8 says that the ambiguity of those equations (picking 
the representative of the equivalence class) does not matter in the end. 

What is fascinating is to watch children work with clock arithmetic. They will 
instinctively realize that 10 + 3 = 13 = 1 meaning for us that 10+ 3 = 13 = 1 
in a 12h clock and so clock (or modular) arithmetic is intuitive. If you introduce a 
5-clock (1 = 5) or an 8-clock (n = 8), they will adapt immediately. 
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We have learned that systems in which we can add and multiply have other 
properties, such as associative, commutative, and distributive laws. These properties 
carry over immediately to modular arithmetic, (although others may not). There is 
also an analogy to 0 and 1 in modular arithmetic as you will show in Problems 6 
and 7 at the end of this section. 


Notation. We will use “zero in Z,,” to mean the equivalence class of 0 and “one in 
Z,,” the equivalence class of 1. 


Definition. MODULAR ARITHMETIC, or ARITHMETIC MODULO 1, is the set Z,, 
together with addition and multiplication as defined in Eqs. (3.2.1) and (3.2.2). The 
ZERO ELEMENT of Z, is 0 and the ONE ELEMENT of Z,, is 1. 


We will now examine whether the usual rules of arithmetic (for the integers Z) 
are also obeyed in modular arithmetic. The differences between Z,, and Z often can 
make for some exciting classes in middle and elementary school. Of course, for any 
two integers a,b € Zif ab = 0 then one (or both) are zero. That statement need not 
be true in modular arithmetic. For example, if n = 12 then (2)(6) = 0 but 2 4 0 
and 6 0. Thus it is possible for there to be elements a,b € Z,, with ab = 0 but 
a # Oandb ¥ 0. When does this happen? 


Proposition 9. [fn > 1 is not a prime number then in Z, there are non-zero 
elements k and £ with kt = 0. 


Proof. Since n is not prime, there are integers k and € with k£ = n and neither k 
nor € is equal to n. Thus ké = [k] [¢] = [ké] = [n] = [0] = 0 Since k and £ are 
between 0 and n, neither of them is a multiple of n and sok # O and @ ¥ 0. a 


The converse is also true. 


Proposition 10. Suppose that for a givenn > 1, Z, has the property P: 
k and € Z, and k@ = 0, then eitherk = 0 orl = 0. 


If property P holds for all k and £, then n is a prime number. 


Proof. We will assume n has property P and must prove that 7 is prime. Suppose 
that n = ab where a,b € Z. Usually we have shown that either a = 1 orb = 1. In 
this case, we will do what is logically the same, show that either a = n or b = n. 
Since n = ab then [n] = [a] [b] but [n] = 0, so 


0=ab. 
Property P says that in this case either @ = 0 or b = 0. If @ = O thena isa 


multiple of n which means that a = n since ab = n and both a and D are integers. 
The same logic applies to b. Thus n must be a prime number. a 
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What we’ve seen is the fascinating result that in n-clock arithmetic it is possible 
that two non-zero “numbers” can multiply to “zero” exactly when n is a composite 
number (that is, not prime). Try it yourself with n = 6. 

We will present division in Z,, in a series of Exercises at the end of this section, 
separating carefully out two different cases, one in which n is a prime number and 
the other in which n is a composite number. 

We will now give an example of how an equivalence relation can be used to 
describe a geometric shape (a circle). It gives us an opportunity to see sets and 
equivalence relations applied to topics (like geometry) other that those in number 
theory. In fact, the study of rotations and translations can also be phrased in the 
language of equivalence classes. As an aside, we may define two triangles as being 
CONGRUENT if there is a rotation or a translation of the plane or a combination of 
these which carries one triangle to the other. The definition of congruence is the 
same concept as the idea of congruence of triangles from middle and high schools. 


Definition. Let R be the real numbers and put the relation 
ri~zrifr; —m2€Z 


on R. This relationship is called EQUALS MOD Z or EQUALITY MODULO Z or 
EQUALITY MOD 1. 


The set of equivalence classes mod Z is usually written R/Z and read “IR mod Z”. 
The next two exercises describe all the equivalence classes of R/Z. 

How can we visualize the set of equivalence classes mod | (called “equivalence 
classes mod 1”), R/Z? It is tempting to think of R/Z as the interval between 0 and 
1, but that is incorrect because of [0] and [1]. Put another way, what does the set of 
equivalence classes mod 1 look like? For each real number between 0 and | there 
is a unique equivalence class and every equivalence class is represented except the 
equivalence class of 0 (see Problems 10 and 11). 

Unfortunately, we can’t identify the set of equivalences with the numbers 
between 0 and | inclusive because 0 and | represent the same element of the 
equivalence classes mod | because 0 and | give the same equivalence class, which is 
Z (Problem 10). Note that if r > 0 and r > 0 then [r] approaches [0] = Z (loosely 
speaking). 


=—_r S = 


Note that if s < 1 and s — 1 then [s] — [1] = Z. Thus we may visualize the 
equivalence classes mod | as the interval 0 < r < 1 but with the endpoints (0 and 
1) identified. Try this construction with a string yourself and you get a circle with 
circumference |. 

The result of the identification is that R/Z “looks like” a circle. This little 
example shows how the properties of numbers can turn into questions and problems 
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of geometry (or topology to give the above figure a different field). We’ve just 
shown a way of constructing the circle by using equivalence classes (of course, 
there are many other ways). Relations, especially equivalence relations, permeate 
modern mathematics. As a thought problem, how would you define addition or 
multiplication on the set of equivalence classes mod Z (the circle)? Prove that your 
definition is well defined. 


nAbBwWNh 


Problem Set 3.2: 


. Do the following clock arithmetic. 


(a) Ona 5-clock, what is 3h after [2]? What is 8h after [2]? 

(b) Ona 12-clock, what is 5h before 3 (Mathematically, what is [3] — [5])? 

(c) How many times can you add [8] to itself on a 12-clock to get the zero in 
Zy2, or is this impossible? 


. Show that in modular arithmetic 0 is the set of multiples of 7. 

. Prove that multiplication in Z,, is well defined (that is, prove Proposition 8B). 

. Prove, in modular arithmetic, that (@ + b ye =act+ be. 

. In this exercise, we define subtraction in Z,, and ask you to prove that it is well- 


defined. If k and @ are elements of Zn, we define 


k-L=k—. 


(a) Suppose that kj = ky and £; = £. Show that k, — £ is equivalent to 
ky — Lo. 
(b) Conclude from Part A that k, — 2; = ky — £o. 


Compare your proof with that of Proposition 8. 


. In modular arithmetic, show that the only element of Z that “looks like” the 


additive identity is 0. More precisely, prove that if 


x+a=dforallae€ Z, 


then x = 0. 


. In modular arithmetic, show that the only element of Z,, that “looks like” the 


multiplicative identity is 1. More precisely, prove that if 
xa=dforalla€ Z, 


then x = I. 


. Show that in Z)> 


x + [12] = X for all ¥ € Zy2. 


Does this contradict Problem 3? 


. Prove that equality mod 1 is an equivalence relation on R. 
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10. Show that [0] = Z and [1] = Z. Inequality mod Z. 
11. Ifr € R then, inequality mod 1, 


(a) [r] = Zor 
(b) there is an s with 0 < s < 1 such that [r] = [s]. 


Problem Set 3.2: Division 


The purpose of these exercises is the explore the notion of division in Z,,. There 
is a significant difference in the results that follow depending on whether 7 is a 
prime number or not. We first give two examples (n = 5 and n = 12) to motivate 
the definitions and then deal with the general theory. 


1 Div. On a 5-clock (5 is prime), show that for every ¥ € Zs, except [0], we can 
add x to itself enough times to get 1 (Hint: list all of the elements of Zs and 
try it individually). 

2 Div. Ona 12-clock (12 is not prime), is it true that for every X € Z)2, we can add 
X to itself enough times to get 1? For which X can we and for which can we 
not? 


Definition. If x € Z,, then we say that x has a MULTIPLICATIVE INVERSE if there 
isa y € Z, such that x y = 1. 


You have shown that every non-zero element of Zs has a multiplicative inverse 
but that this is not true for Z)>. 


3 Div. Analogous to Problem | Div, show that if 1 is a prime number, every non- 
zero element X has a multiplicative inverse in Z,. 

4 Div. Analogous to Problem 2 Div, show that if is not a prime number, then there 
exists ax € Z, such that x 4 0 and X does not have a multiplicative inverse. 

5 Div. Let n be a prime number and @,b € Z, with @ # 0. Prove that there is one 
and only one X € Z, such that 7 ¥ = b. (That is the linear equation can be 
solved.) 

6 Div. Let n be a composite number. Show that there exists a, be Zn With 
a # 0 such that there does not exist a ¥ with @ ¥ = b. (Hint: Look at 
Problem 4 Div.) 


3.3 Fractions Through Equivalence Relations 


One of the truly subtle concepts of the mathematics of late elementary school 
and early middle school is that of fractions. In most books for future teachers the 
definition of a fraction is given as a “part of the whole” and operations on fractions 
given as rules and motivated by representations of fractions by colored regions. 
(pizza model), fraction strips, set model, or the number-line model, for example 
(see Chapter 6, Long et al. [6]). The criteria for a successful representation of a 
fraction is a clear answer to each of the following questions: 
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¢ What is a unit? 
¢ Into how many equal parts has the unit been subdivided? 
¢ How many of these parts are under consideration? 


For the fraction ¢, b is the answer to the second bullet and a is the answer to the 


third bullet. : 

We believe very strongly in the intuitive powers of pictures, connections, and 
representations. However, there also needs to be a logically consistent basis for 
any part of mathematics and the concept of fractions should be no exception. A 
good overview of how mathematicians present fractions to future elementary school 
teachers is contained in McCrory (2006) (See [7]). Ultimately each approach is 
circular or assumes some geometric properties of the line. We are not arguing that 
fractions should be taught to either future elementary teachers or to the children 
using equivalence relations as presented in this section. Rather, this section is 
intended for those who need to understand deeply the logical foundations on which 
the arithmetic of fractions lies. Those people include math specialists in elementary 
school, future middle or high school teachers, mathematically sophisticated high 
school students, or math majors. 

We now define a set F and an equivalence relation on F’. The equivalence classes 
of F (with regard to this relation) will give a precise definition of what a fraction 
is. We then proceed to show that the usual definitions of addition and multiplication 
of fractions are well-defined in this rigorous treatment. Our approach will follow 
the logical argument of the previous section for adding and multiplying in modular 
arithmetic. We’ll see immediately (in Eq. (3.3.1)) that the “equivalent as fractions” 
relation mimics the usual statement that the fractions o = hk if and only if ajb2 = 
azb,. We’ll also see that, just as in modular arithmetic, we will need to show that 
each operation on fractions is WELL DEFINED. 


Definition. Let F be the set 
F = {(a,b) | a and b are integers and b 4 0} 


If @ = (a), b,) and B = (a2, b2) are elements of F then we say that w and f are 
EQUIVALENT AS FRACTIONS if 


ab = ab; (3.3.1) 


and write a~r B 
Proposition 11. The relation ~r is an equivalence relation on the set F. 


Proof. It is easy to see that ~- is both reflexive and symmetric. To show that ~ F is 
transitive assume that a = (a), b)), B = (a2, b2) and y = (a3, b3) are elements of 
F witha ~-r B and B ~f y. We have 


ab => arb and arb; => a3b2. 
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We must show that wa ~ y which is the same as a3b; = a,b3. Since by 4 0, we 
may use the previous equalities to obtain 


= a,b3. 


_ ayb3 = (azb,)b3 _ (a,b2)b3 
b | lh by 


Thus a ~F y by the definition of equivalent as fractions. a 


Definition. The SET OF FRACTIONS Q is the set of equivalence classes of F' with 
respect to the equivalence relation ~. It is written as 


Q=F/~F 


It is important to remember that an element of Q is actually a set of pairs of integers 
(where the second integer isn’t zero). A FRACTION is an element of Q. This means 
that a fraction, in our approach, is a set of pairs of integers in which the second 
coordinate is never zero. 

Of course, this is a very formal definition of fraction, depending only on the 
construction of the set of integers, Z, and some elementary set theory. How does 
this construction relate the definition of fractions from elementary school? 


Notation. Throughout the remainder of this section, [a] will mean the equivalence 
class of a with respect to ~F. 


If a = (a,b) € F then the fraction ¢ is the equivalence class of (a,b). Thus 
+ = [(a,b)] € Q (note that b cannot be zero because of the way the set F' is 
defined). 

The next example may be difficult at first, but understanding it thoroughly will 


make the rest of the material in this section easier. 
Example 67. Let a = (1,2) € F . What does [a] look like? 
Solution. If 6 € [a] and 6 = (a,b) then (1,2) ~¢ B so that (3.3.1) is 


1-b=a-2orb=2a. 
In other words, there are infinitely many pairs in [a] such as 
(—2, —4), (-1, —2), (1, 2), (2, 4), (3, 6), (4, 8),... € [(, 2)]. (3.3.2) 


In the language of children, each of these are the fraction written as 5. // 


Notation. If (a,b) € F then the fraction ¢ is the fraction equivalence class of 


(a,b). Thus ¢ = [(a,b)] € Q. (Note that b cannot be zero because of the way in 


which the set F is defined.) 


Example 68. Show that > = 5=7=2= 3 
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Solution. 5 is the set [(1, 2)] which includes (see (3.3.2)) the elements (—1, —2), 
(2,4), (3, 6), (4, 8), etc. Because of Theorem 12, since (2,4) € [(1, 2)], it must be 
that [(2, 4)] = [(1, 2)]. Each of the other fraction equalities are proven in the same 
way. // 


Good news: our formalism actually shows that a fraction can be represented by 
many different pairs of numbers, just as in grade school! 

Before we prove the next result, let’s look at what is involved in showing that two 
fractions are equal. Since a fraction is a set, we should prove that each of the pairs 
are in the same set. Fortunately, thanks to Theorem 12, we only need to prove that 
there is one element in one of the sets which is also in the other and that is enough. 
Now would be a good time to review the consequences of Theorem 12. 

Here’s another result which is very familiar to you and a key to the conceptual 
understanding of fractions. 


Proposition 12. [fn is a non-zero integer and ¢ is a fraction then 


a na 
ic op (3.3.3) 


Proof. Given § € Q, we can write it as $ = [(a,b)]. (Of course, we could have 


chosen many other representatives of the equivalence class 7.) Let 
B = (na,nb) € F. 


Proving this proposition means answer the question, is 6 fraction equivalent to 
(a, b)? This question can be restated as asking if 


(a,b) ~p(na,nb) = B. 
From the definition of ~f (see (3.3.1)) this happens exactly when 
a(nb) = (na)b, 


which is always true. Since (a, b) is fraction equivalent to (na,nb), they determine 


na 


the same equivalence class and hence the same fraction. Therefore — = “t. Hi 


We now turn our attention to operations with fractions, specifically addition 
and multiplication. As in the case of modular arithmetic, we must show that the 
definition we will use for addition and multiplication does not depend on the choice 
of the representative of the equivalence class. The usual way of defining addition 
and multiplication of fractions is 


a, a2 a\a2 
(3) (=) ~ dybp Pie 


ay m ay aby + arb, 
Dy by bib» 


(3.3.5) 
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where a1, b1, a2, bz are integers with b} # 0 and by ¥ 0. What is never explained 
in elementary books about arithmetic is why these definitions actually make sense. 
After all, ; and - are the same quantity. What would happen if I chose 5 for the first 
fraction and you chose 2, and I picked z and you 8 for the second? Of course, the 
answers (5) ( =) and (3) (8) are the same but there is actually something to prove. 
The issue is whether the operation of multiplication is WELL DEFINED. When 
doing modular arithmetic, we also needed to prove that addition and multiplication 
were well defined. We first ask you to do two exercises with specific fractions as 
preparation for the general result (the first of which is also left as an exercise). 


Exercise 39. Remember 5 is a set and so is 7 Consider 2 = [(2,3)] and i = 


[(1, 5)]. Pick any two elements a = (a;,b)) and a2 = (a2, bo) of 5 and any two 
elements of 61 = (ci, d1) and B2 = (c2, d) of i and show that 


A. (a,c, b}d)) ~F (@2€2, bod2) 
B. [(a1¢1, b1d;)| = a: 


In the language of elementary school, we’ve asked you to show that 


independent of what representatives of the equivalence classes are chosen. 


Theorem 14. Suppose (a,,b1), (a2, b2), (c1, d1), (€2, d2) are elements of F.. If 
(a1, b1) ~F (dz, ba) and (c1, di) ~F (C2, a2) (3.3.6) 
then 
(a,c, b}d)) ~p (a2C2, brd2) (3.3.7) 


and 


aid, + cb, dydz + bacz 


3.3.8 
bid F bods ee 


Proof. We'll leave the first result (3.3.7) as Exercise 40. We'll now prove (3.3.8). 
We know that, because the pairs are fraction equivalent, 


a,b = arb, and c\dz = c2d). (3.3.9) 
Thus the left hand side of (3.3.8) is 


ayd, + cb, _ bydo(a,d, + cb) _ (a, bz) (di dz) + (cid2)bi by 
bid, by bod, dy bi bod\ dy . 
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Using (3.3.9) we find 


aid; + cb) _ (dybi)di dz + (cod1)(b1b2) = byd)(a2d> + cab) 


bid; byb2d dz bi bod, dz 


which is exactly the same as the right hand side of (3.3.8). a 
Exercise 40. Under the assumptions of Theorem 14, prove Eq. (3.3.7). 


Theorem 14 now allows us to make the definition of multiplication and addition 
of fractions in a formally correct, unambiguous, and rigorous manner since the 
Theorem says that any two elements of F which are fraction related will give the 
same fraction equivalence class after addition and multiplication. 


Definition. If ; and I are two fractions then the PRODUCT and the SUM of the 
those fractions are given by 


and 


Problem Set 3.3: 


1. Remember that ; is a set and so is 2. Consider 3 = [(, 4)] and 2 = [(3,5)]. 
Pick two elements a = (a1, 5)) and a2 = (a2, b2) of ; and any two elements 
By = (ci, d)) and Bz = (c2, do) of 2 and show that: 

(a) (aici, bids) ~F (@2C2, bad) 
(b) [(aic1, b1d1)] = 5- 
In the language of elementary school, you have just shown that ()) = x. 


2. Remember that ‘ is a set and so is 3. Consider ‘ = [(1, 4)] and ; = [(3,4)]. 


Pick two elements a, = (a1, )) and a = (a, b2) of t and any two elements 
By = (cy, d)) and By = (C2, dz) of + and show that: 


(a) (a,c1, 61d) ~ F(a2€2, bod2) 
(b) [(arcy, bidy)] = %. 


3. Ifb £0 andc ¥ 0 and both are integers, show that 
[(0, b)] = [@, ¢)]. 


4. If b € Zis not zero, we will write 


0 = [(0,5)] = 


100 


10. 


LI 
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Prove that for any fraction 5, 
(@) 0+5=4 
(b) (0) (5) =0 


Note that the equation of Part (a) shows that 0 is an additive identity. It is easy 
to see that multiplication and addition are commutative and associative. 


. If a is non-zero, prove that 


. Leta # 0 and write 


1= [4] 


The fraction 1 is well-defined by Problem 5. Prove that 


c c 
(5)=§ 

d d 
for any fraction 5. (This equation shows that 1 is the MULTIPLICATIVE 


IDENTITY.) 
Let a and a be fractions. Show that 


(a) § + § = $+ § (addition is commutative). 


(b) (¢)(S) = ($)(£) (multiplication is commutative). 


Describe what it means for addition of fractions to be associative and prove 
your statement. 

Describe what it means for multiplication of fractions to be associative and 
prove your statement. 

Let ¢ be a fraction. Prove that there is a fraction x with 


a 
—-+x=0 
b 

if and only if x = >". This is the statement of existence and uniqueness of 


ADDITIVE INVERSES OF FRACTIONS. 
Let ¢ be a fraction with a # 0. Prove that there is a fraction x with 


(vt 


if and only if x = B This is the statement of existence and uniqueness of 
MULTIPLICATIVE INVERSES OF FRACTIONS. 
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12. Show that multiplication is distributive across addition for fractions. More 
formally, if a, b,c, d, e and f are integers with b #4 0,d # Oand f 4 0 


£642) -()(9+((4). 


3.4 Integers Modular n and Applications 


Overview In Sect.3.1 the notion of the congruence mod n is introduced. Con- 
gruence modulo n is a relation on the set of integers with a related to b if 
a = b (mod n). It is proved that congruence modulo n is an equivalence relation on 
the set of integers in Sect. 3.2. This section studies some properties and applications 
with congruence such as RSA cryptosystem, Universal Product Code (UPC), and 
International Standard Book Number (ISBN). 


Proposition 13. Assume that a = b (mod n) andc = d (mod n). Then 


A. a+c=b+d (modn) 
B. a—c =b-—d (modn) 
C. ac = bd (modn) 


Proof. By assumption n|(a — b) and n|(c — d). Thus n|(a — b) + (c — d) which 
implies that n|(a + c) — (b + d) which means that a + c = b + d (mod n). This 
proves A. Similarly it is easy to show B. To prove C, we note that n|(a — b) implies 
that n|c(a—b), so that n|(ac—bc), i.e., ac = bc (mod n). Similarly, since n|(c—d), 
we have n|b(c — d) which implies n|(bc — bd), i.e., bc = bd (mod n). Therefore 
ac = bd (mod n). a 


Remark. Congruence acts like equality in many ways. 
An elementary induction argument allows us to have 
Corollary 5. Assume that a, = b,(modn), a2 = bo (modn), +++, dn = 
bm (mod n). Then 
A, tdo 4+: + dm = by +b. +--+ + by (mod n) 
and 


142***Am = bi by+++dDy (mod n). 


Corollary 6. [fa = b (mod n), then a‘ = b* (mod n) for any positive integer k. 


Note: If a = b (modn), then ac = bc (modxn) for any integer c by Proposi- 
tion 13C. However, the converse is not true. As an example, 4-3 = 2-3 (mod 6), 
but 4 ¢ 2 (mod 6). 
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Proposition 14. [fac = bc (modn) and GCD(c,n) = 1, thena = b (mod n). 


Proof. By hypothesis, we have n|(ac — bc) and so n|(a —b)c. But GCD(c,n) = 1, 
thus n|(a — b), i.e., a = b (modn). a 


Example 69. 5-3 = 1-3 (mod 4) and GCD(3, 4) = 1 implies that 5 = 1 (mod 4). 
Example 70. What is the remainder when 2”° is divided by 41? 
Solution. Since 2° = —9 (mod 41), by Proposition 13 C we have 
2/0 — 2°.2° = (—9)(—9) = 81 = —1 (mod 41), 
and 
Of 9h = (11) = 1-aned 41), 


so that 27° = 1 (mod 41). Therefore the remainder is 1 when 2” is divided by 41.// 
Example 71. What is the remainder when 1997!° is divided by 19? 
Solution. Since 1997 = 2 (mod 19), by Proposition 13C we have 


1997! = 2! (mod 19). 
Now, 
2* = 16 = —3 (mod 19) and 28 = 2*. 24 = (—3)(—3) = 9 (mod 19). 
Thus 
28.2? = 9-4 = 36 = 17 (mod 19). 


Therefore, the remainder is 17. // 


3.4.1 RSA Cryptosystem 


Cryptography is the study of sending and receiving secret messages. The goal of 
cryptography is to send messages across a channel (computers, electronic media) 
so that only the intended receiver of the message can read it. This is done by 
enciphering the message in such a way that an unauthorized person will find it very 
difficult to infer the message. 

The message to be sent is called the plaintext. The message after transformation 
to a secret form is called the ciphertext. The plaintext and the ciphertext are 
both written in an alphabet, consisting of letters or characters (A,..., Z), digits, 
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punctuation markers, and blanks. A cryptosystem has two parts: encryption, the 
process of transforming a plaintext to a ciphertext, and decryption, the process 
of transforming from ciphertext back to a plaintext. The entire process can be 
summarized in the diagram: 


m -— encrypting > c —decrypting—> m (3.4.1) 
(plaintext) (ciphertext) (plaintext) 


The most widely known public-key cryptosystem is the RSA cryptosystem 
proposed by Rivest et al. [9]. We will need to define the Euler ¢-function. 


Definition. The EULER PHI FUNCTION (x), is the number of positive integers 
less than n that are relatively prime (or coprime) to n. 


Example 72. What is the Euler phi function #(12)? 


Solution. ¢(12) = 4, since 1, 5, 7, and 11 are the only integers that are positive, 
less than 12, and coprime to 12. // 


Algorithm: Key generation for RSA public-key cryptosystem (see Mollin [8]). 
Summary: Each entity creates an RSA public key and a corresponding private key. 
Each entity should do the following: 


1. Generate two large random (and distinct) primes p and q, each roughly the 
same size (about 100 digits). 

2. Compute n = pq and ¢(n) = (p—1)(q—1). ¢ depends on the two distinct 
primes p and q. 

3. Select a random integer e, 1 < e < $(n), such that GCD(e, d(n)) = 1. 

4. Use the Euclidean algorithm to compute the unique integer d,1 < d < 
o(n), such that ed = 1 (mod ¢(n)). 

5. A’s public key is (1, e); A’s private key is (d, p,q). 


(I 


Na 


Encryption. B encrypts a message m for A, which A decrypts. 
B should do the following: 


1. Obtain A’s public key (7, e). 

2. Represent the message as an integer m in the range {0,1,...,n-1}. 
3. Compute c = m* (mod n). 

4. Sendc to A. 


(ID) Decryption. A should do the following: 


1. Compute m = c4 (mod n),0 < m <n. 


Example 73. Let p = 3,q = 17. Thenn = pq = 51 and (nm) = 2-16 = 32. 
Since GCD(e = 11, ¢(”) = 32) = 1, we may choose e = 11 and hence d = 3. To 
encrypt message “G”=07, compute c = 7!! = (78)(77)(7) = 31 (mod 51). So Bob 
sends c = 31 to Alice. To decrypt the message using d = 3, one has m = 317 = 7 
(mod 51). Hence m = 07 = “G”. 
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Example 74. Let p = 29,gq = 53. Thenn = pq = 29-53 = 1537 and ¢(n) = 
28-52 = 1456. Since GCD(47, 1456) = 1, we may choose e = 47 and hence 
d = 31.d = 31 can be computed by the Euclidean algorithm as follows. 
Since GCD(47, 1456) = 1, we calculate 

1456 = 30-47 + 46 


47=1-46+1 
Thus, 


1=47-1-46 
= 47—1- (1456 — 30-47) 
= 31-47— 1456. 
To encrypt message “HI”=0809, c = 809*7 = 1439 (mod 1537). So Bob sends 
c = 1439 to Alice. To decrypt the message using d = 31, Alice has 
m = 1439°! = (—98)*! (mod 1537) 
= (—1)"1(98"') (mod 1537) 
= (—1)*!(728) (mod 1537) 
= —728 = 809 (mod 1537) 


Hence the message m = 809= “HI”. 


Example 75. Let p = 43,gq = 47. Thenn = pq = 43-47 = 2021 and ¢(n) = 
42-46 = 1932. Since GCD(e = 61, ¢(n) = 1932) = 1, we may choose e = 61 and 
hence d = 1837. To encrypt message “BE”=0205, c = 205°! = 1422 (mod 2021). 
So Bob sends c = 1422 to Alice. To decrypt the message using d = 1837, Alice 
has 


m = 1422'897 = (—115)!897 = 205 (mod 2021) 


Hence the message m = 205= “BE”. 


Exercise 41. Suppose Alice’s public key is (n,e) = (91, 11). If you want to send 
a message containing the single letter P (16), what ciphertext does she receive, and 
how does Alice decipher it? 
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3.4.2; UPC and ISBN (See Gallin and Winters [3], Rosen [10]) 


(1). Universal product code (UPC) is now found on most product in grocery 
and retail stores. The UPC has 12 digits. The first six digits identify the 
manufacturer, the next five identify the product, and the last is a check digit 


(see figure). 
0 | | 2 


If djd>-++d 2 is a valid UPC code, then 
3d, + dy + 3d3 +. d4+-+--+3d1; + dj. =0 (mod 10). (3.4.2) 


Example 76. 1s 0 — 21000 — 65897 — 8 a valid UPC? 


Solution. To verify it, we calculate 


3-04+1-24+3-14+1-043-04+1-04+3-64+1-54+3-84+1-94+3-741-8 
=24+34 18454 244+94214+8=54+84+54+44+941+4+8 
=3+9+0+8=0 (mod 10) 


Thus, it is a valid UPC. // 


Example 77. Find the missing digit of UPC 021000x58798. 


Solution. We calculate 
3-0+ 1-2+3-1+1-0+3-0+1-0+3-x+1-5+3-8+1-7+3-9+1-8 = 3x+6 = 0 (mod 10). 


Therefore, 3x + 6 = 0 (mod 10). Thus x = 8. // 
Exercise 42. Find the correct check digit of the UPC code with 070330201 18 as its 
first 11 digits. 

dI). ISBN—10 


Since 1972, a book published anywhere in the world has carried a ten-digit code 
called an International Standard Book Number (ISBN). An ISBN consists of four 
parts: a group code, a publisher code, an identifying number assigned by the 
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publisher, and a check digit. For example, in the ISBN 0-201-87073-8, the group 
code (0) denotes that the book was published in an English speaking country 
(Australia, Canada, New Zealand, South Africa, the United Kingdom, the USA). 
The next group (201) denotes the publisher, and the third group (87073) denotes 
this particular book among all those published by that publisher. The final digit (8) 
is the check digit, which is used to detect errors in copying or transmitting the ISBN. 

The check digit has 11 possible values: 0, 1, 2, 3, 4, 5, 6, 7, 8, 9, or X (XK 
represents the number 10). If dd --- do is a valid ISBN, then 


10d, + 9dz + 8d3 + 7d4 + 6d5 + 5d + 4d7 + 3dg + 2do+ d\yp = 0 (mod 11). 


Example 78. 1s 0 — 534 — 91500 — 0a valid ISBN? 


Solution. We calculate 


10-0+9-548-34+7-446-94+5-144-543-04+2-041-0 
= 454244 2845445420 
=14+2+4+64+104+5+4+9 
= 33 = 0 (mod 11). 


Hence, it is a valid ISBN. // 
Example 79. Find the missing digit of ISBN 0 — 67x — 98039 — 1. 


Solution. Again we calculate 


10:04+9-64+8-74+7-x+6-94+5-84+4-043-342-9+1-1 
= 54+ 564+ 7x +544+40+9+4+ 184+ 1 
=104+14+7x+10+74+94+7+1 
=114+7x+11+ 23 
= 7x + 1=0 (mod 11) 
Hence 7x + 1 = O (mod 11) implies 7x = —1 = 10 (mod 11). Hence x = 3 
(since 7-3 = 21 = 10 (mod 11)). Thus the missing digit is 3. // 
Exercise 43. Find the correct check digit of the ISBN number with 0-673-48582 as 


its first nine digits. 
Problem Set 3.4: 


1. What is (a) 6!°° (mod 7) (b) 837° (mod 7) (c) 25°8 (mod 7)? 
2. What is (a) 935 (mod 13) (6) 40255 (mod 13) (c) 1997" (mod 19)? 


3.4 


lon 


10. 
11. 
12. 
13. 


14. 
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. Letn € N. Show that n = 0 (mod 3) if and only if the digit sum of n is 


divisible by 3. 


. Determine whether each of the following pairs is congruent modulo 9. 


(a) 1,19 (b)—1,8 (c)—9,52. 


. For which positive integers 7 is each of the following true? 


(a) 28 = 4 (mod n) 
(b) 152 = 142 (mod n) 


. Show that a” = 0,1 (mod 4) for alla € Z. 
. Show that ifn is an odd positive integer, then 1 +2+3+---+(n-—1) =0 


(mod n). 


. Find the least positive residues mod 13 of each of the following integers. 


(a) 930 
(b) 253 


. Determine whether each of the following twelve digit strings can be the UPC 


of a product 


(a) 044000045548 
(b) 031604028465 


Find the missing digit of UPC 021000758798 

Find the missing digit of ISBN-10 0 — 54—? — 16509 — 9 

We are using the RSA system. Suppose I chose (n,e) = (77,5), (77,7). If 
you want to send a message “Z” to me, what ciphertext do you actually send? 
Explain which (n,e) you pick and why. 

We are using the RSA system. Suppose my (n,e)= (33, 13) and I receive from 
you a ciphertext message 22. How do I decipher it? 

We are using the RSA system. Let p= 97, q= 103. Suppose e= 13,19. Then 
what is the cipherkey, respectively? 


Part II 
The Algebra of Polynomials 
and Linear Systems 


Chapter 4 
Polynomials and the Division Algorithm 


Introduction At this point, you are quite adept with algebra and, especially, 
polynomials. However, in this chapter, we will examine polynomials with four goals 
in mind that are very different from your previous study: 


(a) to give the subject a formal consistency (for example, careful definitions of ideas 
and operations); 

(b) to view a polynomial in two different ways such as an indeterminate form or 
special kind of function in one variable; 

(c) to develop the analogy between the set of numbers and the set of all poly- 
nomials. 


4.1 Addition and Multiplication of Polynomials 


Overview In this section we carefully define what a polynomial is and formally 
define equality, addition and multiplication of polynomials. These operations allow 
us to create new polynomials from previously known examples in a similar 
manner to show how we create new integers or real numbers using addition 
and multiplication of previously known integers or real numbers. The similarities 
between the set of polynomials and the set of integers will direct us in our study. For 
the sake of completeness, we include the following definition. 

A student’s notion of what a function is changes throughout schooling. At first, 
it’s just an assignment, then it becomes a “general rule” f for assigning to each 
number in its domain another number, and ultimately f(x) is a way to give to each 
point x in a set (the domain of f) a value, f(x), in another set (the range of /). The 
formal symbol x can be thought of as an arbitrary or typical value of the variable, 
or x can be thought of as a quantity which is not determined (an indeterminate). 
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Definition. Let n be a non-negative integer and do,...,a, be real numbers such 
that a, ~ 0. An expression of the form 


P(X) = anx" + nix ape 2x” + a\x + ao 


is called a POLYNOMIAL in the indeterminate x with real numbers as COEFFI- 
CIENTS such as do, @1, ...,d,. The coefficient dp is called the CONSTANT TERM 
and a, is called the LEADING COEFFICIENT of p(x). If a, = 1, then p(x) is called 
a MONIC polynomial. 

For the sake of completeness, we add the special case where all of the coefficients 
are equal to zero and we say p(x) is the ZERO POLYNOMIAL. A polynomial 
in which all coefficients are zero except possibly for the constant term is called 
a CONSTANT POLYNOMIAL. Occasionally, we may emphasize that p(x) is a 
polynomial over the real numbers R. 


Definition. If 1 is the largest non-negative integer such that a, 4 0, then we say 
that the polynomial p(x) has DEGREE n, and write deg(p(x)) = n. The zero 
polynomial has no degree. 


Definition. Two polynomials are EQUAL when they have the same degree and their 
corresponding coefficients are equal. More precisely, if 


Pi(X) = dy x" + dy—1x" | + +++ Fax + ao 
and 
pr(x) = by x" + dyx" | +++ + dix + do, 


then p;(x) = po(x) if and only if a9 = bo, ay = by, ..., and dy = by. 


Example 80. If (a—b +5)x? + (3a +2b—11)x + (c —3) is the zero polynomial, 
find the values of a, b, and c. 


Solution. By the definition of the zero polynomial, each coefficient must be zero. 
Thus 


3a-—b+5=0, 3a+2b—11=0, andc—3=0. 


Solving these three equations for a, b, and c gives: 


1 16 
a=- De ED 


// 
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Example 81. Let p(x) = ax* + bx? + cx? + dx +e be a polynomial with 
a,b,c,d,e € Z and assume 


3\a| + 4|b — 2| + Jc-—1) = 1. (4.1.1) 


What are the possibilities for p(x)? 


Solution. Since the right hand side of Eq. (4.1.1) is 1 and each of a, b, c, d, e are 
integers, there is a limited number of possibilities and we proceed by inspection. In 
particular, 3|a| > 1 implies that a = 0. Similarly, |b —2| = 0 and |c —1| = 1. Thus 
Eq. (4.1.1) implies that: 


a=0,b=2, andc =Oor2. 


Since there are no conditions on d or e, the possibilities for p(x), subject to (4.1.1), 
are 


p(x) = w2+dxte 
or 
p(x) = 2x7 +2x*+dx+e, 


where d and e are arbitrary integers. // 


Note that in Example 81, there are two additional unknowns (parameters) other 
than x in p(x). These parameters are d and e and both of them can have only integer 
values. Frequently polynomials have parameters (but not additional expressions 
involving x) as coefficients. Some parameters give a geometric interpretation as the 
polynomial which depends on a parameter can be thought of as a geometric family 
of polynomials. For example, in g(x) = x? + a the parameter a is the value q(0). 
In other words, a represents the point where this polynomial intercepts the y-axis 
(i.e. at (0,a)). For our solution to Example 81, e is the value of p(x) when p(x) 
intersects the y-axis. 

If we ask what the possibilities for a polynomial are or ask you to write the 
polynomial p(x) with as few parameters as possible, it means to find the coefficients 
in a way that relies on the smallest number of parameters as possible. As a reminder, 
a non-zero CONSTANT POLYNOMIAL has degree zero, a LINEAR POLYNOMIAL is 
a polynomial of degree 1, a QUADRATIC has degree 2, a CUBIC has degree 3, a 
QUARTIC has degree 4, and a QUINTIC has degree 5. 


Exercise 44. If p(x) = a(x?+2x—1)+b(x?+3)+x-—5 is a constant polynomial, 
then write the expression for p(x) in as few parameters as possible. 
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Exercise 45. Let p(x) = a(x? + x”) + b(x? +. x — 2) + x? + ax + 2 bea linear 
polynomial. Write p(x) with as few parameters as possible. What is p(x) called? 


Exercise 46. Let p(x) = box" + byx""!+---+b,_1x +b, where bo 4 0, bj € Z, 
i =0,1,2,...,2 anda + |bo| + |bi| + |bo| +--+ + |b,| = 2. Find p(x) if 


A. p(x) is a constant polynomial. 
B. p(x) is a linear polynomial. 
C. p(x) has degree n. 


In Exercise 46, we asked you to solve a question in a way which illustrates 
the important problem-solving technique: gaining insight by trying special cases. 
Note that the coefficient of Exercise 46 are integers. After studying Exercise 46A 
and 46B, which are straightforward, Exercise 46C asks for the general case, which 
is now easier to solve because you’ve done the two special cases already. If, in the 
future, you are given a problem like Exercise 46C without 46A and 46B, try a few 
special cases (n = 0 orn = 1 or more) to gain insight before attempting to do the 
general problem for any n. 

We now define addition and multiplication of polynomials in the usual way. 


Definition. Let 
Pi(X) = ayx" + An—1x" 1 ++» + a,x +a 
and 
p2(x) = Dm x™ + dmx") ++ + bx + dp. (4.1.2) 


The SUM OF THE POLYNOMIALS p(x) and p(x), is, firstly, the polynomial (p; + 
P2)(x) = pi(x) + p2(x) of degree £ where £ is equal to the larger of the degrees m 
and n; secondly, for p1(x) + p2(x), the coefficients are the same as the sum of the 
corresponding coefficients of p,(x) and p2(x). That is, 


(pi + pr) (x) = cox! + cpyx6! ++ + 1x +c (4.1.3) 


where c; = a; + 5; for each i between O and &. Therefore, the sum of two 
polynomials, p(x) and p2(x), is defined as (p) + p2)(x) = pi(x) + po(x). 


There is a technical issue with Eq. (4.1.3) which is immediately discovered. It 
may be that the degrees of p(x) and p2(x) are different, say n < m. In that case, 
there are no coefficients d)4 1, 4)+42,...@m. To make sense of the formula (4.1.3) it 
is then tacitly assumed that a,4; = dn42 = +++ = Gm = 0. Thus (p; + p2)(x) = 
Pi (x)+ p2(x) = Dinx™ Se Dns x"! ar (Gn +by)x" ae +(ay +b )x+ (ao +bo). 
Another technical point occurs when the leading coefficient of p;(x) is negative of 
the leading coefficient of p2(x) if p, and pz have the same degree. The technicality 
can also come in the product of polynomials. Note Eq. (4.1.4). 
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Definition. Given polynomials p;(x) and p2(x) of (4.1.2), the PRODUCT OF THE 
POLYNOMIALS is the polynomial of degree n + m given by 


(Pi p2)(x) =p (x) p2(x) = noe 
+ +++ + (agbo + ayb, + aob2)x? + (aybo + aobi)x + aobo. 


In other words, 


Pi(x) . P2(x) = Cine +e +C1xX + Co, 
where the 7th coefficient, c;, is given by the formula 
ci) = ajbyp + aj-1b, +++: + aobj;, i =0,...,n +m. (4.1.4) 


Thus, the product of two polynomials, p(x) and p2(x) is defined by (p, - po) 


(x) = pi(x) pa(x). 

To help to remember Eq. (4.1.4), the sum of the two subscripts on each term in 
the right hand side is always i. Furthermore, Eq. (4.1.4) is exactly the formula for 
the conventional method of multiplying polynomials from middle or high school. 


Example 82. The formal definitions of addition and multiplication of polynomials 
are described in Eqs. (4.1.3) and (4.1.4). Here are two polynomials, f(x) = 3x7 + 
4x + 1 and g(x) = 2x? — x + 4 that will be used in parts (a), (b) and (c). 


(a). To compute f(x) + g(x), with our proceeding notations, a2 = 3,a,; = 4 and 
ao = 1 whereas by = 2, b,; = —1 and bo = 4. Hence 


F(x) + g(x) = Bx? + 4x + 1) + (2x7 -— x +4) 
= (3 +2)x? + (4-1)x + (1+ 4) 
= 5x7 4+ 3x45. 
(b). Similarly, for product of f(x) and g(x): 
f(x) + g(x) = e4x* + 03x? + cox? + crx + €0, (4.1.5) 


where 


C4 = Agbo + a3b, + Arbo + a,b3 + anby = 0-44 0-(-1) + 4-04+1-0=6 
C3 = a3bo + anh, + ay bp +: anb3 = 0-44 3-(-1) + 4-441-0=5 

C2 = aby +. a,b) + apbo = 3-44 4-(-1) +1-2= 10 

Cy = abo + anh) = 4-44 1-(-1) = 15 


Co = dopbn = 1-4 = 4. 


Thus, f(x)g(x) = 6x* + 5x? + 10x? + 15x + 4. 
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(c). Alternatively, to compute f(x) - g(x) = (3x? + 4x + 1)(2x? — x + 4) we 
could multiply and collect like terms: 


(3x? + 4x + 1)(2x? — x + 4) = 3x7(2x? — x + 4) 
+ Ax(2x? =x +4) 
£10 =a+4) 
= 6x* + 5x3 + 10x? + 15x + 4. 
We could also write vertically (by expanding the product) 
3x7 ++ 4x41 


x 2x7- x+4 


12x? + 16x +4 
—3x3-—47?- x (4.1.6) 


6x* + 8x? + 2x? 


6x7 + 5x? + 10x? + 15x +4 
+ 


Note that the coefficients displayed vertically between the bars in this example 
are exactly the products in (4.1.4). For example, the coefficient of the quadratic 
term is 


C2 = Aanbp + ayby + aod) = 124+ 4(-1) +1-2=10, 


which comes from the coefficients in the column above the arrow. 
Exercise 47. Compute the following additions and multiplications. 


A. What polynomial is the sum of x? — x? + 72x —5 and x3 — 4x + 1? 

B. Add x* + 2x? — 1 and 3x? + 4x2 —x +5. 

C. What polynomial is the product (x? — x? + 72x — 5)(x? — 4x + 1)? 

D. What is (x* + 2x? — 1)(x3 + 4x? —x + 5)? 

Exercise 48. Find the coefficients of x’ and x!° respectively for the product (x° + 


2x4 — x3 + x2 — Ax + 5)(x® + 7x5 — 4x3 + 2x + 1). 


Example 83. Let p(x) = (x? —ax +b)(ax? + x —b) be given where a,b € R. If, 
for p(x), the coefficient of x is 1 and the coefficient of x is 9, find the values of a 
and b. 
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Solution. Since p(x) = (x? —ax + b)(ax? + x —b), the quadratic term is (—b — 
a + ab)x? and the linear term is (ab + b)x. So we gain the system of equations: 


—b-a+t+ab=1. (4.1.7) 
ab+b=9, (4.1.8) 
Subtracting these two equations yields 
—2b-—a =-8s0a = 8—2b. 
Substituting the last equation into (4.1.7), we obtain 
2b* — 9b + 9 = (2b — 3)(b— 3) =O, 


and so b = 3 or 3. 

There are two cases. If b = thena = 8-2. ; = 5 and if b = 3 then 
a =8-—2-3 =2. So the two solutions area = 5,b = > anda =2,b=3. // 
Example 84. If 5x? = a(x? + x + 2) + (bx + c)(x + 1), what are the possible 
values for the real numbers a, b, and c? 


Solution. Since 5x? = (a + b)x? + (a+ b+c)x + a +c), by the definition of 
equality for polynomials we obtain the system of equations 


a+b=5 
a+b+c=0 
2a+c=0. 


Solving these three equations in three unknowns yields only one solution 
5 5 
c=-5,a=-, andb=-. 
2 2 


// 
Example 85. (Version 1) If 


a(x — 2)(x — 3) + b(x — 1)(* — 2) + c(x — 3)(x — 1) = 3x —4, (4.1.9) 


then what are the real numbers a, b, and c? 


Solution. By collecting like terms we can rewrite (4.1.9) as: 


(a+b+c)x? + (—5a — 3b — 4c)x + (6a + 2b + 3c) = 3x —4. 
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Equating coefficients of the two sides of the equality gives the following three 
equations: 


at+b+c=0 
—5a — 3b—4c 
6a + 2b + 3c = —4. 


II 
Ww 


Solving the above system of equations, we obtain a = —+, b = 3, c = —2 as the 
only solution. These values satisfy Eq. (4.1.9). // 


Often there are multiple ways to solve a problem where one of which is faster 
than the others. Generally, the faster approach requires a conceptual insight into the 
mechanics of the problem. In the case of Example 85, the insight is the interpretation 
of the polynomial of (4.1.9) as a function and recognizing that when x = 1 or x = 2 
or x = 3 the arithmetic becomes much easier. 


Example 86. (Version 2) Solve Example 85 in another way. 


Solution. Since Eq. (4.1.9) is true for any x € R, consider the case where x = 
1, x = 2, or x = 3. These three values of x infiltrate a number of zeros of the 
Eq. (4.1.9) that make things much easier. 

If x = 1, then 


1 
a(—1)(—2) = 3-1—4 so that 2a = —1 ora = = Rs 
On the other hand, if x = 2, then 
c(2 —3)(2—1) = 3-2—4s0 that —c = 2 orc = -2. 


And finally if x = 3, then 
5 
b(2)(1) = 9— 4 s80 that 2b = 50rb = rt 


// 


Why did the solution fall out so simply in the second version? The problem- 
solving strategy was to think of a polynomial as a function and notice that when 
x = 1,2 or 3, one or two of the terms on the left hand side were zero. Said in other 
words, our problem-solving strategy meant finding a different interpretation (in this 
case as a function) and looking at the equation to see what values might simplify the 
arithmetic. 

It is true, of course, that any three values plugged into Eq. (4.1.9) would 
ultimately yield the result a = —}, b= 3, and c = —2. However, the strategy 
to pick numbers judiciously made the arithmetic much easier and was the most 
advantageous approach. 
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The moral (which would be wise to follow in Exercise 49 below) is to make 
sure you stop and think before starting a problem. A few minutes of thinking about 
problem-solving strategies will result in much less work and frequently, additional 
insight. 

Exercise 49. Assume that x* — 8x? + 5x? — 30x +8 =a+b(x—1)+c(x—-1) 
(x — 2) + d(x — 1)(x — 2)(x — 3) + e(x — 1)(x — 2)(x — 3)(x — 4). Find the real 
numbers a, b, c, d, and e or, if there is no solution, explain why. 


Problem Set 4.1: 


Through this problem set, a, b, c, d, and e are real numbers unless otherwise stated. 


= 


. Simplify x?(4x? — 2x — 6) — (x — 1)*(4x? + x — 2). 
2. If p(x) = a(x? —x + 1) + d(x? + 2x) + 1 is the zero polynomial, what are a 
and b? 
. Let g(x) = r(x? — x?) +. s(x? + 2x) +3x7? + tx —1 with r,s,¢ € R bea linear 
polynomial. Write g(x) with as few parameters as possible. 
4. If h(x) is a quadratic polynomial, write h(x) with the fewest parameters 
possible. 
. (a). If f(x) = ax? + (b— 1)x? + (ce +3)x — Land g(x) = 3x7 4+ (a—4)x+ 
(b—a-+c), finda, b, andc so that f(x) = g(x) (if there is a solution). 
(b). Find a, b, and c (if possible) so that the equation 3x3 — 4x? + 5x —c = 
(x? — 2x + 3)(ax + b) is valid. 
6. Find, if possible, integers a, b and c so that (a+b+c)x*+(|a|+|b|)x+e = 
. Find, if possible, integers a, b, and c such that (a + b + c)x? + |a — b|x? 
(la| + |bJx +c =0. 
. Find, if possible, real numbers a, b, and c so that 3x3 — 4x? + 6x — Cc 
(x? — 2x + 3)(ax +). 
. Find the coefficients of x? and x!! in the polynomial g(x) where 


1S) 


Nn 


x 
+ 9° 


[oe] 


II 


\o 


q(x) = (x? +2x® — 5x3 + x 7x — 33x!) 45x? -— 7x9 + 23x°) 


10. Let 3x? + 4x —5 = a(x + 1)(x— 1) + b(x— 1)(% — 2) — (x —2)(@ + 1). Find 
the value of a and b. 

11. If (x — 1)(x — 2)(x + 3) = x7 + ax? + bx +c, find the value of a+b +c. 

12. If (x — 1)(x + 2)(x — 3)(x + 4) = axt + bx? + cx? + dx +e, then 


(a) what is e? 
(b) Whatisa+b+c+d? 


13. Let 2(x+3)*—15(x+3)?+31(x+3)?—8(x+3)+8 = ax*+bx3+ex*+dxte. 
Find the values of a, b, c, d, and e. 

14. If 54x? + 27x? + 42x + 32 = a(3x +2)? +b(3x +2)? +c(3x + 2) +d, find 
the value of a + c. (Hint: Find the value of a+ b+ c+ d first.) 
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15. How would you explain where formula (4.1.4) appears in your solution to 
Exercise 47A to an Algebra I class? (Hint: Write out the solution in a form 
similar to Example 82. Look at the “scaffolding” (4.1.6) for the explanation.) 

16. Let p(x) = box” + byx""! 4+. --+ + b,x +b, where p(x) has degree n. If 


[bol + [bi] + +++ + [bn] = 0 


find all possible polynomials p(x) if 


(a). p(x) is aconstant polynomial. 
(b). p(x) is a linear polynomial. 

(c). p(x) is a quadratic polynomial. 
(d). p(x) is a polynomial of degree n. 


17. Let p(x) = (x +.a)(x + b)(x +c) and q(x) = (x —a)(x + b)(x +c) where 
a,b,c € R. Assume 


1. the coefficient of x? of p(x) is 0 
2. the coefficient of x of g(x) is 0 
3. the coefficient of x of p(x) is equal to the coefficient of x? of q(x). 


Prove thata = 1 andbc =b+c. 

18. Let (x? —5x + 4)(x? —7x +5) = axt+ bx? + cx? + dx +e. Find the value 
ofa+b+ct+dte. 

19. If the coefficient of x? in polynomial (3x? — 2x + 6)(x? + ax? + 8) is —28, 
then what is the value of a? (Answer: a = 17) 


4.2 Divisibility, Quotients and Remainders of Polynomials 


Overview Our aim for the next sections is to show how properties of the integers 
and those of the real numbers are similar to properties of the set of all polynomials 
in one variable. In this section, we see that formally, the Division Algorithm 
for Polynomials is similar to the Division Algorithm for Integers of Sect. 1.1. 
The definitions, theorems, and even the proofs in this section are very similar to 
corresponding statements in Chap. |. Our viewpoint will be strengthened in Sect. 5.3 
when we discuss the GCD and LCM for two polynomials. 

Recall the Division Algorithm in Z: If a,b € Z, with b > 0, then there exists 
unique integers g and r such that 


a=qb+r, O0<r<b. 


The algorithm by which g (the quotient) and r (the remainder) are found is just 
long division (or successive subtraction). Similarly, there is the Division Algorithm 
for Polynomials, which also can be proven by successive subtraction. Note that we 
cannot tell when one polynomial is greater than another, so instead we compare their 
degrees. 
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Theorem 15 (Division Algorithm for Polynomials). Let f(x) be a polynomial 
and g(x) be a non-zero polynomial. Then there exist unique polynomials q(x) and 
r(x) such that 


F(x) = a(x) - 8%) + r(x), 


where either r(x) = 0 or degr(x) < deg g(x). We call the polynomial q(x) the 
QUOTIENT, r(x) the REMAINDER, and g(x) the DIVISOR of f(x). We also say that 
F(x) when divided by g(x) has remainder r(x) and quotient q(x). 


Example 87. Let f(x) = 2x3 +4x?+x—2 and g(x) = 2x? +x—1 be the divisor 
of f(x). Find the quotient and remainder of f(x) when divided by g(x). 


Solution. We divide f(x) = 2x37 + 4x? +x —2 by g(x) = 2x* + x—1 using long 
division. 


3 
X+5 


2x? + x —1)2x3 4+ 4x2 + x-2 


2x? + x7-—x <— 

; (4.2.1) 

3x° +2x —2 

I. 3 3 

3x + 5x5 

I I 

7X 3 

Hence 
3 2 3 2 1 1 

2x°+4x°+x-2= x+5 (2x°+x—-1)+ a : (4.2.2) 


How do we interpret Eq. (4.2.2) in terms of the Division Algorithm for Polyno- 
mials? The Eq. (4.2.2) says that 


3 1 1 
_ @ ies 4.2. 
fis) = (x43) 20 + (5x- 5) (4.2.3) 
quotient, q(x) remainder, r(x) 


so the quotient is g(x) = x + 3 and the remainder is r(x) = $x - s. The remainder 
r(x) has degree 1, which is less than deg g(x) = 2. // 


Similarity of Division Algorithm for Polynomials and Division Algorithm for 
Integers. There is something more going on in the solution of Example 87. Looking 
at the long division, we see that (just as for integers) we are doing successive 
subtractions. For example, the line directly under f(x) = 2x? + 4x? + x —2 
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in the division (4.2.1) is 2x? + x? — x = xg(x). Following the process for the case 
of integers, we subtract the line directly under f(x) from f(x) to obtain the next 
line, 3x? + 2x — 2. We now explain why in the next paragraph. Just as in the integer 
case, the degree is lowered by 1! 

What follows in this paragraph is an intuitive explanation of why there is an 
analogy between long division of integers and the division of polynomials. We will 
use f(x) and g(x) from Example 87 and see that the first step in the division of 
f(x) by g(x) gives a polynomial (2x? + x? — x) at the arrow in Eq. (4.2.1) whose 
degree is 2 (one less than the degree of f(x)). This is similar to starting with a four 
digit number, say 2314 = 2-107 +3-10°+ 1-10+4, and getting a first step (at the 
arrow below to a three digit number) or a step which takes a “10° number” to a “107 
number”. In a numerical example, dividing 2314 by 11 is written in long division as 


210 

11) 2314 

2200 
14 

_ 110 

4 


The number 114, at the arrow represents reducing the degree by one—going from 
a 10? = 1000 term (2314) to a 10? = 100 term (114). Thus, in the first step, for 
integers in base 10, we are going from x? to x* where x = 10. (The second step 
reduces from x? to x! when x = 10.) This intuitive motivation should be helpful 
to you but note that not every division of an integer by another integer “reduces the 
power of 10 by one”. For example, divide 99 by 22 or 9 by 2. 

Long division of polynomials is exactly successive subtractions (just as for 
integers). In Example 87, we have “subtracted g(x) a total of x times” from the 
f(x) and then subtracted ; g(x) from what is left. The product xg(x) has degree 
one more than 3g (x). If the original polynomial f(x) has degree n, then there will 
be at most 7 subtractions to perform. (In fact, having (7 — 1) subtractions can only 
occur if deg g(x) = 1. Usually, there are fewer calculations.) 

We will now use the Division Algorithm for Polynomials in which there are 
different divisors of the given polynomials f(x) = ax” + bx — 4. 


Example 88. Let f(x) = ax?+bx—4 be divided by x +1 and x—1 with respective 
remainders 3 and 1. What is the remainder of f(x) when divided by x + 2? 


Solution. There are three different quotients qi (x), g2(x) and q3(x) starting with 
F(x). The statements of the first sentence of the example are given in part (a) and 
part (b). Part (c) uses the quotient q3(x) with a remainder r. 
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f(x) = ax? +bx-4= q(x) +14 _ 3 (a) 
divisor remainder 

f(x) = a(x)\@-)+1 (b) 

F(x) = qa(x)(x + 2) +1. (c) 


Then we obtain 
f(-1) = a(-1)? + b(-D) - 4 = 3, 
f(-2) = a(-2)? + b(-2) — 4 =r. 


ie. 
a—-b-—4=3, sothata—b=7, 
a+b—4=1, sothata+b=5, 
and 4a—2b-—4=r. 
Solving this system of the first two equations, we have a = 6,b = —1 andr = 
4a —2b —4 = 4(6) — 2(-—1) — 4 = 22. // 


Example 89. Let f(x) = ax? + bx +c be divided by x + 1 and 2x — 1 with 
respective remainders 1 and —2. What is the remainder when f(x) is divided by 
2x7 +x —1? 


Solution. When f(x) has a divisor (2x? + x — 1) which is also a quadratic, the 
remainder must be linear or constant, say ax + b where a,b € R. Let f(x) = 
q(x)(2x? + x — 1) +. ax + b where q(x) is the quotient polynomial. Then 

f(-) =-a+b=1 

F(1/2) = (a@/2) +b = —-2 


Solving this system we find a = —2 and b = —1. Thus the remainder is —2x — 1. // 


Example 90. Let f(x) be divided by x +2 and x?—x—2 with respective remainders 
3 and 5x + 1. What is the remainder when f(x) is divided by x? — 4? 


Solution. f(x) = q(x)(x* — 4) + ax + b where q(x) is a quotient. By evaluating 
x = —2 and x = 2, we see first that 


f(—2) =—2a +b = 3. (4.2.4) 
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With g) (x) as a quotient of x? — x — 2, f(x) = qi(x)(x? — x —2) + 5x +1, we 
have 


fQ)=2a+b=11 (4.2.5) 


Solving (4.2.4) and (4.2.5), we obtain a = 2 and b = 7. Hence the remainder is 
2x +7. // 


Example 91. What is the remainder when x + x? + x?° + x4? + x8! is divided by 
x? — x? (See Honsberger [5]) 


Solution. We present a different and very easy way to find the remainder other than 
[5]. Let x+x9 +2 +249 438! = g(x)(x3—x)+ax?+bx+c, where ax?-+bx+ce 
is the remainder we want to find. (As our divisor is of degree 3, we know that our 
remainder should be at most degree 2.) Since the roots of x* — x are x = 0 and 
x = +1, we can now find the remainder in the form of ax? + bx +c anda quotient 


q(x). 

Now we can use the roots of x?— x to isolate our unknowns a, b,c. Letting x = 0 
immediately results in 0 = c. Letting x = 1 gives 5 = a + b, and letting x = —1 
gives —5 = a—D. Solving this simple system gives us that a = 0 and b = 5. Hence 
the remainder is 5x. // 


Example 92. Let p(x) be a polynomial with the property that if p(x) is divided 
by (x — 2)* a remainder of 56x — 42 is obtained. If p(x) is divided by (2x — 1) 
a remainder of 5 is obtained. What is the remainder if p(x) is divided by (x — 2)? 
(2x — 1)? 


Solution. The minimum degree of p(x) is 3. From the first sentence we know that 
there is a polynomial q; (x) with 


P(x) = qi(x)(x — 2)” + (56x — 42). 


Using the Division Algorithm for g|(x) with divisor (2x — 1) gives a polynomial 
q(x) and a real number a such that g;(x) = g(x)(2x — 1) +a. Thus 


p(x) = (x — 2)?(2x — 1)q(x) + a(x — 2)? + 56x — 42. (4.2.6) 


We now let x = s. Then p (4) = “4 +.28—42 = 5sothata = 7 Substituting a = 
% into Eq. (4.2.6) gives a remainder of 48 (x—2)?+56x—42 = §(76x?+200x—74). 
Il 


Exercise 50. If a given polynomial p(x) is divided by x? + x + 1 a remainder of 
3 is obtained. If p(x) is divided by x + 5, a remainder of 7 is obtained. What is the 
remainder, r, when p(x) is divided by (x* + x + 1)(x + 5)? 
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Exercise 51. What is the remainder and quotient when one divides the polynomial 
p(x) = 4x4 + 2x3 — 4x? + 6x — 1 by g(x) = 2x? — x — 1? (Answer for remainder 
is 8x — 1.) 


Problem Set 4.2: 


1. Let f(x) = 4x3 + 8x? + x — 1 and g(x) = x + 2. Find the quotient and 
remainder of f(x) by g(x) by doing long division. Compare your answer to 
Example 100. 

2. Let f(x) = 9x? + 4x? — 5x — 2. Find the quotient and remainder of 
F(x) on division by 2x — 1 by using long division. Compare your answer to 
Example 101. 

3. Find the quotient and remainder if the first polynomial is divided by the second. 
Check your answer by multiplying the expressions. 


(a) x? — 2x? + 3x + 1byx?-x4+2 
(b) 407 42? = 19% = 3 by x? =3 


4. Given a polynomial p(x), if p(x) is divided by x — 1 a remainder of 8 is 
obtained. If p(x) is divided by x? + x + 1 a remainder of 7x + 16 is obtained. 
What is the remainder if p(x) is divided by (x — 1)(x7 + x + 1)? 

5. A polynomial p(x), with degree > 3 is given. If p(x) is divided by (x — 1), 
(x — 2), and (x — 3) remainders of 3, 7, and 13 are obtained, respectively. Find 
the remainder if p(x) is divided by (x — 1)(x — 2)(x — 3). 

6. The polynomial p(x) with degree > 3 has the following two properties. If p(x) 
is divided by (x — 1)(x — 2) a remainder of (20x — 6) is obtained. If p(x) 
is divided by (x — 2)(x — 3) a remainder of 32x — 30 is obtained. Find the 
remainder when p(x) is divided by (x — 1)(x — 2)(x — 3). 


4.3. The Remainder Theorem 


Overview The root of solution of a polynomial equation has a very careful 
relationship to the Remainder Theorem. Let f(x) = a,x” + ayg—jx""! +0) + 
a,x + a be a polynomial of degree n. For any real number c, we define f(c) by 
the usual formula, 


f(c) = anc" + oe a +++ +a,;C+ a. 


That is, f(c) is obtained when the constant c is substituted for x. 


Definition. If f(x) is a polynomial and c € R then c is a ROOT of f(x) if 
J (c) = 0. We also say that c is a SOLUTION of f(x) = Oif f(c) = 0. 


One of the conclusions of the Division Algorithm (Theorem 15) is that for c a 
real number, regardless of what polynomial is divided by x — c, the degree of the 
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remainder must be less than one. Thus, the remainder is a real number r rather than 
a polynomial of degree one or higher. If q(x) is the quotient of f(x) upon division 
by x — c then we have 


F(x) = a@)(e- 0) +1. 


Evaluating this equation at x = c gives the following Remainder Theorem. 


Theorem 16 (Remainder Theorem). /f the polynomial f(x) is divided by x —c, 
then the remainder is r = f(c). 


Corollary 7. If the polynomial f(x) is divided by ax — b, then the remainder is 
f (3). 


Throughout this text we have used the usual vocabulary of proof technique. 
A COROLLARY means an easy consequence of the theorem which precedes it. Thus, 
in Exercise 52 below, it is natural to use Theorem 16 to prove the Corollary 7. 


Exercise 52. Prove Corollary 7 


Example 93. If f(x) = x*+ + 5x3 + 4x? + x — 5 is divided by x — 4, find the 
remainder. 


Solution. We can compute the remainder in two different ways: directly dividing 
F(x) by x — 4 or using the Remainder Theorem. We’ll do both so that you’ll see 
what is involved in the two methods and can make a choice of method in future 
problems. 


(a) Using long division by x — 4 and, after much arithmetic, we see 
f(x) = (x3 + 9x? + 40x + 161)(x — 4) + 639. 


Thus, the remainder of f(x) on division by x — 4 is 639 and so the remainder 
is 639. 
(b) The other way to do this problem is to simply substitute x = 4 in f(x) to obtain 


F(4) = (4) + 5(4) + 4(4?) + 4-5 = 639. 
T 
Example 94, Find the remainder if f(x) = x!> + 1 is divided by x — 1. 


Solution. By the Remainder Theorem, the remainder is f(1), hence the remainder 
is f) =1541=2. H 


Remark. The point is that we did not have to use either long division or synthetic 
division (see Sect. 4.4) in this problem. 


Example 95. When f(x) = 2x? — x? + ax — 9 is divided by 2x — 3 the remainder 
is 0. What is the value of a 


4.3. The Remainder Theorem 127 


Solution. By the corollary of the Remainder Theorem, we have the remainder 
if (3) which is given to be 0. Thus, if ; is aroot of f(x), then 


0)-@) eG) 


After simplifying the equation we see that 


= cae 9 = 0or27—9+ 6a —-36=0 
a 54 = 0or a = 0. 


Hence a = 3. We have shown that if there is a value of a such that f(x) has 3 asa 


root, it must be a = 3. We check our solution by calculating f (3) = 0 fora = 3. 
// 


Example 96. Find the remainder if f(x) = x° — 4x* + x3 + 9x? + 10x — 6 is 
divided by x —1— V2. 


Solution. By the Remainder Theorem, we know that the remainder r must satisfy 
r= f(1+ 2). One way to do that is to plug x = 1+ V2 into the equation for f(x) 
but that gives a very tedious computation so we will look for another approach (a 
mathematician might say “it gets ugly fast”, although this approach will ultimately 
lead to a correct answer). Surely there must be a different way to do the problem 
that requires less computation! 

Let’s start again, using some algebraic intuition to simplify the calculation. Let 
x = 1+ V2 so that x —1 = V2. Thus (x — 1)? = 2 and x2 —- 2x41 = 2, 
so that x7 — 2x — 1 = 0. Motivated by that computation, we now use the division 
algorithm to divide f(x) by x? —2x—1 (instead of by x — (1 — V2)), the remainder 
of f(x) divided by x? — 2x — 1 will be a linear polynomial r(x) since the divisor is 
quadratic. 


x° — 4x4 + x3 + 9x7 + 10x — 6 = q(x): (x? —2x —1) + r(x) (4.3.1) 


We’ll see why this approach is useful right after (4.3.2). Finding the quotient and 
remainder can easily be done by long division. 


x2 — 2x7 -2x +3 


x? — 2x — 1)x5 — 4x4 + x3 + 9x? + 10x — 6 
2x4 x3 
= Ty" 4 9x? 4 Gy? 


—2xt 4+ 4x3 + 2x? 


= Ox? 4 7x? + 10x 
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—2x? + 4x? + 2x 


3x? + 8x —6 
3x" — 6x —3 
14x —3 


Hence for f(x) divided by x7 — 2x — 1, the quotient is g(x) = x? — 2x? —2x +3 
and the remainder is r(x) = 14x — 3. This means that 


f(x) = 0° — 4x4 + x3 + 9x? + 10x — 6 
= (x? — 2x? — 2x + 3) (x? — 2x —1)+ (14x -3) 
_—_—_—_—_—_—_—_—_—__ eS —-_— 


q(x) divisor r(x) 
or writing g(x) = x3 — 2x? — 2x +3, 
f(x) = q(x) (Qc? — 2x — 1) + (14x — 3). (4.3.2) 


The key here is that we have found a form of Eq. (4.3.1) which is very easy to 
evaluate when x = 1 + V2. The point is that q(1 + V2) need not be computed. 
Rather we notice that q(x) is multiplied by (x2 — 2x — 1) which has x = 1+ /2 as 
a root of f(x) and hence, using Eq. (4.3.2), its product with q(1 + 2) is also zero. 
To finish the example, 


f(l+ V2) = q+ V2)04+ 141 + V2) —3 = 114 14V2. (4.3.3) 


// 


The solution to Example 96 is a case of problem-solving by infiltrating zeros. 
The approach is to involve the division algorithm in a way that greatly simplifies the 
arithmetic through multiplication by 0 (see Eq. (4.3.3)). Some thought shows that a 
linear term will not help much but “getting rid of the square root” might (and does). 
That approach requires squaring and so we end up using a quadratic polynomial 
x* — 2x — 1 as a divisor of f(x). The key, of course, is that x* — 2x — 1 is zero 
when x = 1 + V2. An approach which seems to look like it will make things more 
complicated (going to a quadratic instead of using a linear polynomial) in fact makes 
the problem easier! 


Remark. Since 1 — V2 is also a root of f(x) = x? — 2x — 1 = 0, we have f(1 — 
V2) = q(x)-0+ 14-(1— V2) -3 = 11 14v2. 
Exercise 53. Find the remainder if x* — 3x? — 2x? + 1 is divided by (x — =livs i 


(Answer: ils ) 
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A final remark We can use the remainder theorem to give another proof of tests 
for divisibility (see Sect. 1.1). 


Proof. Let f(x) = ayx" + Gy—1x"! +---+ a,x + ao be a polynomial of degree n 
with coefficients in Z and 0 < a; < 9(i = 0,1,--- ,”), a, 4 0. If f(x) is divided 
by x — 1, then the remainder theorem gives 


AnX" +++» + a,x + do = q(x)(x — 1) + (Gy + Gn-1 +++) + 0). 
Let x = 10, we have 
Gy 10" + dy—10"~! +--+ + a,10 + ay = q(10)- 9+ (a, +--+ +a) +a). 


Thus, we have proved the divisibility test for 3 and 9. a 


Theorem 17 (Divisibility Test 3/9). Let m be represented asm = dy,dn—\+++a2 
a,dao. Then we have the following results: 

1, 3 divides m if and only if 3 divides ay + Gn—1 +++ + 42 +1 + ao. 

2. 9 divides m if and only if 9 divides ay + An—1 +++ +d. +a, + a. 

Exercise 54. By using the remainder theorem, prove the divisibility test for 11. 
(Hint: Consider a,x” + dy—1x"—! +++» + a,x + ao is divided by x + 1.) 


Problem Set 4.3: 

1. If x3 + 3x? — x — 6 is divided by 2x — 5, find the remainder by infiltrating 
Zeros. 

2. If (x? + ax* + 3x — 4) is divided by (x + 1) with remainder 2, find the value 
of a. 

3. If (x? + ax? + 5x + 1) is divided by (2x — 1) with remainder 4, find the value 
of a. 

4. If (x? +. ax? — 6x + 2) is divided by (3x + 2) with remainder 10, find the value 
of a. 


5. Let p(x) = x3 +. ax? +bx —2. If p(x), when divided by x + 1, has remainder 
—1 and p(x), when divided by x — 1, has remainder 3, find the real numbers a 
and b. 

6. Let p(x) = x3 +.ax? +bx—1.If p(x), when divided by x + 2, has remainder 
—11 and p(x), when divided by x — 2, has remainder 17, find the real numbers 
a and b. 

7. Let p(x) = x3 +ax? + bx —6. If p(x), when divided by x — 2, has remainder 
2 and p(x), when divided by x — 3, has remainder 51, find the real numbers a 
and b. 

. Find the remainder if p(x) = x® + 5x5 — 10x3 + 4x — 1 is divided by x — V3. 

9. Find the remainder of 


Co 


f(x) =x? + 4x4 -3x 42 
if it is divided by x + 2— V3. 
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10. Find the remainder of 
g(x) = x? =x? 43x -12 


if it is divided by x —1— V5. 

11. If x° + 3x? — x — 6 is divided by x — (./5 — 1), find the remainder. 

12. If 4x4—8x3—15x?+ 13x + 1 is divided by 2x — (3 + 2/2), find the remainder. 

13. If 9x4 —54x3 + 8x? + 8x — 1 is divided by x — (5 — 32), find the remainder. 

14. If x4 —22x? — 48x + 2 is divided by x — (/2 + V3 — V6), find the remainder. 

15. If x44 11x7+21x +4 4 is divided by x —(V/3 + /5+ V10), find the remainder. 

16. Find the remainder of p(x) = x° + 4x° — x? + 1 on division by x — 1+ /2 
by infiltrating zeros. 

17. If (x + 3)" + 1 is divided by x + 1, find the remainder (as a function of 7). 

18. Suppose that p(x) = ax? + bx + is a quadratic. If the remainder of p(x) on 
division by x —7 is 3 and the remainder of p(x) on division by x — 2 is 4, show 
that a and b are relatively prime. (Hint: Use Proposition 5.) 

19. Suppose that neither a or b is zero andr # s. If the remainder of g(x) = 
ax* + bx + ¢ on division by x — r is d, and the remainder of g(x) on division 
by x — s is dy, show that GCD(a, b) divides d, — dp. 

20. Let f(x) = x? -—ax +b. If f(2) = 6, f(—1) = 3, then what are real numbers 
a and b? (Ans: a = 0, b = 2.) 

21. Let f(x) = x++ px*+q(x) +a’. If f(x) have two roots 1 and —1, then f(x) 
has two other roots a and —a. Prove this result. 


4.4 Synthetic Division 


Overview This section is not necessary nor dependent on other sections, but it is 
quite interesting. Some calculation and notation can be saved by omitting powers of 
x in the long division of polynomials, although care must be taken to keep track of 
where each of the coefficients came from. This observation leads to the notational 
shortcut called Horner’s scheme or Synthetic Division and involves division of 
any polynomial by another polynomial; however we will emphasize division by 
linear or quadratic polynomials. The method is taught in high school in the US 
under the name Synthetic Division and in other countries, such as Taiwan, at a 
similar grade level but usually called Horner’s Scheme. We will use these phrases 
Synthetic Division and Horner’s Scheme interchangeably; Horner’s scheme is due to 
W.G. Horner (1786-1837) in about 1819. 


Example 97. Let f(x) = 2x3 + 4x? + x —2 and the devisor g(x) = 2x7 +x-1. 
Complete the polynomial division while omitting all powers of x. 


Solution. Coordinating the polynomials f(x) and g(x) without any x terms is 
easily done in Table 4.1 below. 
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Table 4.1 The functions of As polynomial As coefficients 
Example 87 3 2 

f(x) =2x3 +402 4+x%-2 2 +4 41 -2 
g(x) = 2x7 +x-1 2 +1 -1 
Table 4.2 Equation (4.2.1) {<2 
as coefficients oe 
2+1-1)2+4+4+1-2 

2+1-1 
3+2-2 
a+ 3-3 
1_d 
2. 2 


Using only these coefficients, the polynomial division in Eq. (4.2.1) is repre- 
sented by Table 4.2. Here the quotient appearing as 1 + ; represents the polynomial 
q(x) = Ix+ 3 and the remainder appearing as 5 = 5 represents the polynomial 
r(x) = $x - 5. As expected, these quotient and remainder terms match those from 
Eq. (4.2.3). 

I 


Example 98. Let h(x) = 2x3 + x + 1 and g(x) = x? + x. Find the quotient and 
remainder of h(x) by g(x) using only the coefficients in the polynomial division. 


Solution. A word of caution is first in order as you construct a table similar to 
Tables 4.1 or 4.2. Don’t forget to include the zero coefficients! Since, really, h(x) = 
2x3 +x+ 1 and g(x) = x? +x, the table for this problem similar to what was done 
for Example 87 is: 


2-2 


110/201 1 


2.2 0 
—-2 1 1 
202.0 
-1 1 
Thus the quotient is g(x) = 2x — 2 and the remainder is r(x) = —x + 1. You must 
make sure not to forget to put in all the terms (even if a coefficient is zero) when 
constructing a table. // 


Exercise 55. Use only the coefficients in long division to find the quotient g(x) and 
the remainder r(x) when the first polynomial is divided by the second. Before you 
start, decide what the degree of the quotient will be. 
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(a) 3x3 +4x7+ 1 byx?7+x+1 
(b) 5x° + 4x34 2x +x+2byx—3 


In the event that the divisor is a monic linear polynomial x — d, then the divisor 
in Table 4.2 reduces to the form 1 — d. At each stage of the coefficient division, the 
initial term of the product of the divisor with the quotient will mirror the quotient. 
In the next example, compare the initial term at each stage with the quotient term 
used to create it. 


Example 99. Divide f(x) = 2x3 + 4x? + x — 2 by the monic linear polynomial 
h(x) =x -2. 


Solution. Since the divisor is monic, we write it as 1 — 2 in the table. 
Thus the quotient polynomial is the quadratic polynomial q(x) = 2x? +8x +17 
and the remainder is the constant polynomial r(x) = 32. // 


Notice that at each stage, we multiply the next term of the quotient by 1 and 
then subtract the product from itself resulting in a zero in that column of the table. 
Eliminating the successive vertical displacement for each product, one can choose 
to write all products on a single line. Typically, one omits the coefficient 1 of the 
x-term in the divisor and changes the sign of the constant term. In this scenario, 
Table 4.3 becomes: 


24 1-2 
2) 416 34 
2817 32 


One thus reads the coefficients of the quotient polynomial from the third row 
remembering that the final term is the remainder. This process is known as Synthetic 
Division or Horner’s Scheme but before a formal definition, let us see why the 
process works. 


Proposition 15 (Linear Divisor Case). Jf f(x) is the polynomial of degree n 
given by 


f (%) = ayx” + ay—yx" | + ayox"* +--+ + ax + a9 (4.4.2) 
Table 4.3 Division by a 2+ 8 +17 +32 
monic linear term as 
coefficients 1-—2)2 +4+4+1- 2 
2—- 4 
8 + 1- 2 
8 — 16 
17 -— 2 
17 —34 


32 
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and 
g(x) =x-c 
where do,...,@y, and c are real numbers, then 
F(X) = (bp a1x" | + Dy nx"* Fo + box? + dix + bo) g(x) +r (4.4.3) 
where 
by—| = An and by = by4ic + agg if 1 <k <n—-2 (4.4.4) 
and 
r=boc + ap. 
Proof. Since g(x) = x —c, the right hand side of (4.4.3) is equal to 
by—\x" + by ox"! free Ht box? + b,x? + box 
—cb,—1x""! = chy_ax"? — +++ — cbox* — chix — boc +r 
= by—1x" = (bn—2 _ eh )x = (bn—3 _ Chg-3)x” Sfesteeest= 
(by — cb3)x? + (by — chy) x? + (bp — cb1)x + (r — boc). 
Comparing this last equation with (4.4.2) term by term, we start with x” and 
conclude that b,_; = a,. The next term (the coefficient of x”~!) is (b,-2 — cby—1) 
OF dy—) = by—2 — Chy—1, which is ay4, = by — chy4, for k = n — 2. The other 


coefficients follow the same rule as given in (4.4.4) by looking at the formula, term 
by term. a 


Definition. Given polynomials f(x) = a,x" +4y)—1x"—! +ay—2x"-? +++ +ayxt+ 
do and g(x) = x —c, the shortcut division of f(x) by g(x) given by the table: 
is called Synthetic Division (also known as Horner’s Scheme). 


The word “sum” in Table 4.4 below means that b,,_; is the sum of the two terms 
above it or: 


bn i= bn i+1€ + Qn—-i+1 and r = ay + boc. (4.4.5) 


Table 4.4 Synthetic division (Horner’s scheme) by a monic linear polynomial 
Coefficient An An-1 An—2 see ay do 
Times Cc 0 bn-1+C Dna ts Dye oboe 


Sum an = bn-1 bn—2 bn—3 bo r 
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Note that, first of all, the above equation is exactly (4.4.4) with k = n—i. Secondly, 
we are used to starting at the beginning (which is where n = | or 0) of a sequence 
and working up in these kinds of algorithms. This method does the opposite. It 
starts with the known coefficients of f(x), do,a1,:*+ ,@, and the constant c of 
g(x) and first finds b,_; (it is a,). Then, it computes using (4.4.5), the next “lower” 
b-coefficient, b,-, as by—1 = byc + dy. Having found b,—1, this gives b,—2 using 
(4.4.5) again. This kind of procedure can be called a “descending computation”. The 
term r is the remainder and g(x) = by—,x"~! + Dy—gx"? fee + box? + bix + bo 
is the quotient of f(x) by x — c. The “sum” means that, as in (4.4.5), 


bn i =by it1€ + Gn-i+1, 


(as in (4.4.5)) which is just (for each i between 0 and 7) the sum of the two numbers 
above b,_;. It is harder to write the formulas than to do the synthetic division! 


Example 100. Use Horner’s scheme to find the quotient g(x) and the remainder 
r(x) if f(x) = 4x3 + 8x? + x — 1 is divided by g(x) = x + 2. 


Solution. Since c = —2, the table created by Horner’s Scheme is below. 
4 81-1 
—2} —80 -2 
4 01-3 
Hence, q(x) = 4x? + 1, and r is the constant polynomial r(x) = —3. // 


Exercise 56. Do each of the divisions below twice, once using long division and 
the second time with Synthetic Division. You should be able to read the similarities 
of the two methods in your work. 


(a) 5x° + 4x? +2x+x+2byx-3 
(b) x*—8x?4+ 16byx4+2 
Synthetic division can also be used if the divisor is ax — b, rather than the monic 


polynomial x—c. If the polynomial f(x) is divided by the general linear polynomial 
ax — b then 


f(x) = q(x)(ax —b) +r 


or 


b 
F(x) = aq(x) (: 7 ?) eo (4.4.6) 
Thus if one wishes to apply Synthetic division to divide a polynomial by a general 
linear polynomial ax — b, the operation is handled by first dividing by the monic 
x B as shown in Example 101 and using Eq. (4.4.6). 


Example 101. Use Horner’s scheme to find the quotient g(x) and the remainder 
r(x) if f(x) = 9x3 + 4x? — 5x — 2 is divided by 2x — 1. 
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Solution. From the discussion of the last paragraph, we first divide f(x) by x — 5 


94=5 =2 

using the table below. 3} 3 # 2 
gis 

a an: 


This last table means, of course, that f(x) divided by x 


Ox? + Ux = ; with remainder J8 or 


f(x) = (9 + oe - 7 (: - 5) + (-F) ; 


However, we are looking for the divisor 2x — 1, not the divisor x — 5. We now just 
multiply and divide the first term on the left hand side by 2 to obtain the solution, as 
we did in the general case of Eq. (4.4.6): 


fy = 5 (ox + Ze D)a(x-3)- 2 


Thus, the quotient of f(x) by 2x — 1 is g(x) = 3x? + x - : and the remainder 


: 9 
is—2. // 


Proposition 16 (Synthetic Division for Linear Polynomial). Jf f(x) = a,x” + 
Ayn —1X"—! + +++ + a,x + do is divided by the linear polynomial g(x) then 


A. (Monic Case) If g(x) = x —c where c € R then the quotient of f(x) by g(x) 
is q(x) = by—yx""! +--++ bo and the remainder is r = dy + boc, where 
bo,-++ , by—1 are defined by Eq. (4.4.5). 

B. (General Case) If g(x) = ax — b for a,b € R then the quotient of f(x) by 
g(x) and the remainder r are obtained by first using Part A to find the quotient, 
written qo(x), and the remainder, ro, of f(x) by x — 5, Then q(x) = 1 40(x) 
andr = ro. 


Exercise 57. Use Synthetic Division to find the quotient and remainder, g(x) and 
r(x), of the first polynomial divided by the second. 


(a) Ax* + x3 + 3x? —4x +1, by 4x —1 
(b) 3x4 + x?—4x +5, by 5x +1 


Horner’s scheme can also be extended to a more general case in which the divisor 
is a quadratic polynomial. If f(x) = a,x" + dy,—1x"~! + -+++ ao is divided by the 
monic quadratic x? —bx—c, we may write a table, similar to Table 4.4 in the linear 
divisor case, for polynomials of any degree divided by a monic quadratic. Before 
we give that result (Table 4.5), we will concentrate on the case in which f(x) is a 
quartic polynomial to simplify the notation. All of the conceptual issues for arbitrary 
polynomials are already present if f(x) has degree four. 
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Proposition 17 (Monic Quadratic Divisor Case). If f(x) is a quartic polynomial 
given by 


f(x) = aax* + a,x? + ax? +a)x +a 
and g(x) is the monic quadratic 
g(x) =x? —bx-—c 
where ao, a1, A, 43, a4, b, & c € Rthen 
f(x) = (box? + bx + bo) g(x) + r(x) (4.4.7) 


where r(x) = rx + ro and 


by = a4 
by = a3+ bby 
bp = an + bb, + chy (4.4.8) 


rn =a,t+bbot+ch 

To = ao + cho. 
Thus the quotient of f(x) by g(x) is q(x) = byx? + b,x + bo and the remainder is 
r(x) =ro trix. 


Proof. Following the same steps as in Proposition 15 (i.e. start by expanding the 
right hand side of (4.4.7)) we obtain, 


f(x) = box* + (b) — bbz) x? + (bo — bb; —cbh2)x? + (71 — bby — cb) x + (ro — cb). 


(4.4.9) 
Equating coefficients of f(x) to those on the right hand of (4.4.9) yields the result, 
Eq. (4.4.8). a 


What would be the table similar to Table 4.4 of the linear divisor case for a quartic 
divided by a monic quadratic? Proposition 17 gives Table 4.5. 


Table 4.5 Synthetic division of quartic by monic quadratic 
Coefficient a4 a3 a2 a, do 
Times b| 0 b-boy b-b, | b-bo 0 
Times c | 0 0 c:by | c-by c-bo 
Sum by by bo ry ro 


4.4 Synthetic Division 137 


Table 4.6 Synthetic division of general polynomial by monic quadratic 


Coefficient An An-1 An—2 An-3 ts AD a, do 
Times b|\0 b-by-2 b-by-3 D+ by-g +++ bb, | b- bg O 
Times c|0 0 C+ Dyno C+bp-3 +++ Cb c-b, c-bo 
Sum Qn = by—-2 bn-3 bn—a bn—s s+» do r| To 


Once again, “sum” means that the entry is the sum of the entries above it. Note 
how each element of the second row gets multiplied by b and those of the third 
row get multiplied by c. Also notice how the indices of the a; goes down by one 
across the top, how that of the b; goes down across the second row (after being 
multiplied by 5) and that of the b; goes down by one in the last row (after being 
multiplied by c). Also, each of the rows “starts” one entry over as we move down 
rows. Furthermore the symmetry of the table can be thought of as a reason why there 
is a zero at the end of the second row. After all, there is no b_, (which would be the 
next coefficient to appear if that makes sense) so putting zero keeps the symmetry. 
Of course, the last sentence is not a mathematical statement—the mathematics is 
contained in Proposition 17. 

To divide a polynomial of degree n by a monic quadratic, the table will simply 
need to have more columns to compensate for the additional terms. Once again, 
r1X +1o is the remainder, and g(x) = b,—-2.x"~? +--+ bx + bo is the quotient of 
f(x) = ayx" + +++ + do by g(x) = x? — bx —c (Table 4.6). 


Example 102. Find the quotient g(x) and the remainder r(x) if f(x) = x*—2x3+ 
x? — 5x + 4 is divided by g(x) = x” — x + 1 by using Horner’s scheme. 


Solution. Table 4.5 for this case is: 


1 -—2 1|-5 4 

1 | 0 1 -1}-1 0O 
-1/0 0 -l 1 1 
1 -1l -1}-5 5 


Hence q(x) = x* — x — 1, r(x) = —5x +5. We can always check with long 
division (or by multiplying g(x)g(x) and adding r(x)). 
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a ae | 


x? —x + 1)x4—2x3 + x? -—5x +4 


xt xe + x? 


ei —5x 

—7° 4x" = % 
—x° —4x+4 
a 
—5x+5 


// 


Exercise 58. Use Horner’s scheme (Synthetic Division) to find the quotient and 
remainder if the first polynomial is divided by the second. Check by multiplying the 
quotient by the divisor and adding the remainder. 


A. x4 +2x3 — x? +5x —2 by x? +2x—-1. 
B. xt + 4x3 — 2x? — 12x — 3 by x? — 3. 
C. 3x4 + 4x? -—5 by x? +x. 


One underlying assumption of Proposition 17 is that the quadratic is a monic 
polynomial. Similar to the case when dividing by a general linear polynomial, when 
dividing by a general quadratic polynomial g(x) = ax* + bx +c, one can factor 
out the lead coefficient and use as a divisor x* + by + © instead before dividing 
the resulting quotient by the coefficient a. The following proposition formalizes this 
process for division by a non-monic quadratic polynomial. 


Proposition 18. Let f(x) be a polynomial and g(x) = ax” —bx—c be a quadratic. 
Then 


A. the remainder of F) on einen by g(x) is the same as the remainder of f(x) 
on division by x” — bx —«5 


B. the ae of f(x) on Gen by g(x) is — | times the quotient of f(x) divided 


b ¢ 


Proof. Let q(x) be the remainder of f(x) divided by g(x) = ax* — bx — cc and 


r(x) = dx +e the remainder. If the polynomial f(x) is divided by the polynomial 


ax? — bx —c, then we have 


II 


F(x) q(x) (ax? —bx—c)+ (dx +e) 


= aq(x)(x? —2x-5)+dx+e. (4.4.10) 
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Therefore q(x), the quotient of f(x) by g(x), is given by dividing the quotient of 
J (x) when by x?+ bx — © by a; in either case the remainders are the same whether 
b c 

x—< o 


we divide by g(x) or x7 — 2x —£. 
Example 103. Using Horner’s Method, if 4x* + 2x3 — 4x? + 6x — 1 is divided 
by g(x) = 2x? — x — 1, then what are the remainder and quotient? Compare to 
Exercise 51 of Section 4.2. 


Solution. We apply Horner’s scheme again with the divisor h(x) = x2 — 4x —4 


2 2? 
instead of g(x). Of course, h(x) = g(x)/2. 
42-4/6 -1 
s| 2 2/0 
f 4.4.11 
+] 2 |2 0 
44 0 8-1 


The remainder of 4x+ + 2x3 — 4x? + 6x — 1 on division by g(x) or A(x) is thus 
8x — 1 from Proposition 18A. Using part B of that same proposition, the quotient on 
division by h(x) is 4 of 4x? + 2x or g(x) = 2x? + 2x, thanks again to Eq. (4.4.10). 
The problem can be done directly without use of Synthetic Division as Eq. (4.4.12) 
shows. 


2x? + 2x 


Ix? = x — 1) 4x4 + 2x9 — 4x2 + 6x = 1 


—4x4 4+ 2x3 + 2x? 


(4.4.12) 
Ax? —2x* + 6x 


—4x3 + ax? +2x 


8x — 1 
Proposition 18A shows that 8x — 1 is also the remainder of f(x) on division by 
g(x) (as well as by h(x)). // 


Exercise 59. In Example 103, why does the table show 4 4 0 8 —1 which would 
seem to imply that the quotient is 4x? + 4x whereas the computation (4.4.12) shows 
the quotient to be 2x” + 2x? 


The Synthetic Division can be used to write a given polynomial as the polynomial 
inax +b. 


Example 104. Let f(x) = x4—3x3+x?+x+19. Express f(x) as the polynomial 
in x — 2, ie. find a, b,c, d,e such that 


f(x) = a(x —2)4 + b(« — 2)? + c(x — 2)? + d(x —2) +. 
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Solution. Using the Synthetic Division, we have 


tas Jiu 49 
DD 5 
ce ee 
ON. + ore 
ae ee) 
7 2 6 
130 
B29 
ORO) 


Thus f(x) = (x — 2)4 + 5(x — 2)? + 7(x — 2)? + (x —2) +17. 


Notice that these values are the same coefficients of the Taylor series about 2. 


f(x) = x4 3x3 4+ x7 +x4195 f(2) = 16—-3(8)+442+4+19=17 
f'(x) = 4x3 — 9x? 42x +15 f’2) = 4(8)-9(4) 422) 4+1=1 
f(x) = 12x? — 18x +2 > f"(2) = 12(4)— 18022) +2 = 14 

ff" (x) = 24x —18 > f"(2) = 24(2) — 18 = 30 
FOR) = 245 FOR) = 24 


By Taylor’s Theorem, we know that 


FPR) cn He 4 a 14 
ag Ti (x= 2)" = a + &- Dt FO -B? 
24 
a 


= 17+ (x—2)4+ 7(x — 2) + 5(@ — 2)? + (x — 2)'. 


+ ade 29) 4 = 2) 


/ 


Exercise 60. Let f(x) = 4x? + 5x? — 7x + 4. Express f(x) as a polynomial in 
(x — 3). 


4.4 
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Problem Set 4.4: 


. Use only the coefficients in long division to find the quotient g(x) and the 


remainder r(x) when the first polynomial is divided by the second. Before you 
start, decide what the degree of the quotient will be. 

(a) 5x4 — 3x3 —x? +x —S5 by 2x*+x+1 

(b) 2x° + 3x4-+ x —5 by 2x2? -—-x +1 


. Use Horner’s scheme to complete the following polynomial divisions by a 


monic linear polynomial. 
(a) 3x? —2x?— 11x + 12 by x —4. 


(b) x* + 2x3 —x? ++x+6byx +2. 
(c) x° — 3x4 + x3 — 5x? + x -—T by x —2. 


. Use Synthetic division to complete the following polynomial divisions by a 


general linear polynomial. 
(a) 3x7 —2x +x—1by3x—-1. 


(b) 2x° — 6x* + 6x3 — 4x? + 2x —7 by 2x +1. 
(c) 8x7 — 24x —1 by 4x —1. 


. Use Synthetic division to complete the following polynomial divisions by a 


monic quadratic. 

(a) x°+x%-—5x4+3byx*-1 

(b) x* + 5x3 + 20x? + 80x + 64 by x? + 5x +4 
(c) x4 — 8x3 + 15x? + 8x — 16 by x7 + 2x41 


. Use Horner’s Scheme to find the quotient and remainder of f(x) divided by 


g(x) if 

(a) f(x) = x4 4+ 4x3 — 2x? — 12x —3 by g(x) = —x? —3 

(b) f(x) = 6x4 + 2x3 — 3x? — x by g(x) = 2x?-1 

(c) What would happen if you did Part A or B using long division instead of 
Horner’s Scheme. (Hint: This is a trick question!) 


. (Synthetic Division of a cubic by a monic quadratic) Let a3, a2, a, b and c be 


real numbers and let f(x) = a3x? + a)x* +.a,x +dy and g(x) = x? —bx—-c. 
If g(x) = byx? + b,x + bo and r(x) = r,x + ro for fixed real numbers bo, by, 
b>, ro, Fr; are such that q(x) is the quotient of f(x) by g(x) is the remainder. 
What is the formula for b2, b;, bo, r;, and rp in terms of the other coefficients 
similar to (4.4.8). (Hint: Mimic Proposition 17 recognizing that in this exercise 
F(x) is cubic, not quartic.) 
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7. By using the Synthetic Division, find a,b,c, d ande. 


(a) 4x7 — 3x +5 = a(2x — 1)? + b(2x — 1)? + c(2x — 1) +d. (Answer: 
a=1/2,b =3/2,c =0,d = 4) 

(b) x4-3x3 +27 +2x4+19 = a(x—2)*+b(x—2)? +.e(x -2) +d(x—2 +e. 
(Answer:a = 1,b =5,c =7,d =1,e = 17) 


8. Do #7 by using Taylor’s Theorem 


Chapter 5 
Factoring Polynomials, Their Roots, 
and Some Applications 


Introduction This chapter focuses on the relationship between polynomials with 
real coefficients, roots which may be complex numbers or rational roots, and GCD 
and LCM for polynomials. 


5.1 Factoring Polynomials and Their Roots 


Overview In Chap.4, we discussed divisibility of polynomials and the resulting 
quotient and remainder. Factoring polynomials means looking for quotients where 
the remainder is zero. Thus, the factoring of polynomials is intimately connected 
with the roots of polynomials. This relationship is explored extensively in middle 
and high school algebra to solve problems involving the roots of polynomials, but 
many students do not realize the conceptual basis for the connection between finding 
roots and factoring. 

In this section, we will see how easy it is to show the connection between 
factoring and finding the roots of a polynomial and then present a variety of 
problems involving the relationship between a polynomial and its roots. We will 
be using Greek letters (a, 6, y,...) for the roots of polynomials. There is no 
assumption that these real numbers are integers or rational unless explicitly stated 
(as in Sect. 5.2). 

The Remainder Theorem (Theorem 16 of Sect.4.3) has an important conse- 
quence which is a corollary showing the connection between factoring a polynomial 
and its roots (or zeros). The corollary is so important that we describe it as the Factor 
or Root Theorem. That relationship was hinted at in Example 96 of Sect. 4.3. 

We will first write the definition of “factor”, which mimics the definition for 
integers. Recall that for integers a and b, if there is an integer c with b = ca, then 
a is called a factor of b. 
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Definition. A polynomial g(x) is a FACTOR of the polynomial f(x) if there is a 
polynomial g(x) such that f(x) = q(x)g(x). In this case, we also say that g(x) 
DIVIDES f(x) and write g(x)| f(x). 


The definition above is equivalent to saying g(x) is a factor of f(x) exactly when 
there is a polynomial g(x) such that f(x) = q(x)g(x). In this form the equation for 
factoring looks very similar to the equation we gain from factoring integers. We say 
that g(x) DIVIDES f(x) if there is a polynomial g(x) such that f(x) = q(x)g(x). 


Theorem 18 (Factor or Root Theorem). Let f(x) be a polynomial and c € R. 
Then f(x) has the linear polynomial x — c as a factor if and only if c is a zero of 


Ff (x), that is f(c) = 0. 


Proof. By the Remainder Theorem, we have 


F(x) = a(x)(« — 0) + fC). 


Hence (x — c) is a factor of f(x) if and only if f(c) = 0. i 


Example 105. If f(x) = 3x3 — 9x? + kx — 12 has a factor x — 3, then what is the 
value of k? Find two ways to solve this example. 


Solution. 


First Solution. By the Factor Theorem, we must have f(3) = 0. Thus 3 - 3° — 9- 
3? + 3-k —12 =Oor81—81 4+ 3k — 12 = 0. We conclude that k = 4. 
Second Solution. Another way to solve this problem is to divide f(x) by x — 3 to 

obtain 


f(x) = 8x? +k)(x — 3) + Bk — 12). 


For x — 3 to be a factor of f(x), the remainder of f(x) on division by (x — 3) 
must be zero. Hence 3k — 12 = 0, which means that k = 4 as in the previous 
solution. 


// 


Example 106. Let f(x) = 345x° —655x*— 50x? + 9x + 1. Use the Root Theorem 
to determine whether x + 1 is a factor of f(x). 


Solution. Since 
f(-1) = 345(—1)° — 655(—1)* — 50(—1)? + 9(-1) + 1 = —1058 4 0, 


it follows from the Root Theorem that x + 1 is not a factor of f(x). // 
Exercise 61. 


(a) If f(x) = x*—3ax? + bx + 4has (x + 1) and (x — 2) as two factors, find a 
and b. 
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(b) Knowing the solution to Part(a), are there other linear factors of f(x) with real 
coefficients besides (x + 1) and (x — 2)? 


Theorem 19. If ax? + bx +c is a quadratic equation with real coefficients, then it 
has real roots if and only if b? — 4ac > 0. Furthermore, if « and B are those roots, 
then 


a+ B =~" andap = © (5.1.1) 


Proof. For the first sentence of the Theorem, if @ and 6 are the roots then (x — a) 
and (x — B) are factors of f(x) = ax? + bx + ¢ and so (x — a)(x — B) = 
x? — (a + B)x 4+ af also is a factor of f(x). Comparing quadratic terms, 


a(x — a)(x — B) = a(x? — (a + B)x + aB) = ax? +bx+e 
and Eq. (5.1.1) are immediate. 
For the second sentence, we will use the standard “complete the square” approach 


and then derive the quadratic equation as a corollary. The roots a and 6 of the given 
equation occur exactly when 


b 
Pe geo S6: (5.1.2) 
a a 
Equation (5.1.2) has the same solutions by adding and subtracting (2) 
rh on ae ae, 
r _ _ 
2a 4a a 


b\? be c b*—4ac 
= —-= . 5.1.3 
(: ss =) 4a> a 4a? ( ) 


Simplifying the previous equation, this proof shows that Eq. (5.1.3) has a real root 
if and only if its right hand side is non-negative, which happens exactly when b? — 
4ac > 0. =i} 


Note: The quadratic ax? + bx +c has complex roots if and only if b*—4ac < 0. 

Implicit in this proof is the quadratic formula from high school which we state 
separately. The proof follows immediately from Eq. (5.1.3) by taking the square of 
both sides of the equation. 


Corollary 8. The quadratic equation with real coefficients, a, b and c 


ax? +bx+c=0 
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has roots 


oe —b + Vb? — 4ac 
~ 2a : 


Because of the previous corollary, the Proposition 19 is immediate. 
Proposition 19 (Rational Roots for Quadratic Equations). The quadratic 
equation 


ax? +bx+c=0 


has rational roots if and only if b? — 4ac is a perfect square. 


Using our newly developed language from Chap. 5, if b? — 4ac > 0 then the 
roots a and # are real and 


—b + Vb? — 4ac 
——_—_—— an 
2a 


—b — Vb? — 4ac 


dsp= 
Bp 2a 


The quantity b? — 4ac comes up so often in algebra it is given a special name. 


Definition. If f(x) = ax* + bx +c is a quadratic, then the DISCRIMINANT of f, 
A(f), is given by 


A(f) = b? —4ac. 
It is usual to omit f and write A if there is only one polynomial involved. 
Once complex numbers are discussed, the results above are interpreted as 


A(f) = O means that the roots of f are real and A(f) < O means that they 
are complex (with a non-zero imaginary part). 


Example 107. Let a, B be the roots of x?+4x—9 = 0. Find the values of a?8 +a 
and a? + B? without finding a and B. 


Solution. Because of Eq. (5.1.1), + B = —4 and aB = —9. Thus, 
a’? B + ab? = aB(a + B) = (—9)(—4) = 36. 
Similarly, 
a + BP =(@+ B) — 3eB(@ + B) = (-4)° —3(-9)(-4) = -172. I 


Exercise 62. Let a, B be the roots of x” + 6x — 120 = 0. Find the values of 7 + 
and a? + §? without finding a and £. 


« 
B 


Example 108. Let a, B be the roots of 3x? + 6x +9 = 0. Find a quadratic equation 


whose two roots are B and zB 
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Solution. Since a and f are the roots of 3x” + 6x + 9 = 0, we havea + B = —2 
and af = 3. The quadratic polynomial we are looking for is x + bx +c where 


—b=444ande = (2) (4) = | [see Eq. (5.1.1)]. Now 


Qa 


pi & _ Pte? @+6P=20p C2=2-3_ 2 


—b=—+ = 7 
a fp ap ap 3 3 
so the monic quadratic equation satisfying the conditions of the problem is therefore 
x? — 2x + 1 = 0. Analternative solution is 3x? — 2x + 3 = 0. / 


Exercise 63. Let a, B be the roots of 2x? + 3x + 6 = 0. Find a quadratic equation 
whose roots are 2a and 26. 


Exercise 64. If w € R and the equation 
x?>+wx+5=0 


has two roots a and #, find the values of the quantities below without first 


calculating a and f. 
4 20 
(a) ¢ + 8 (b) a? + f? (oa? + B? (d) Hm 


Proposition 20. The quadratic equation ax” + bx + c = 0 has two equal roots if 
and only if b* = 4ac (that is, the discriminant of the quadratic is zero). 


Proof. Let a, B be the roots. Since a + B = - and aB = ©, we have 


b? — 4ac = a?(a + BY — 4a(a + B) 
= a[a? — 2aB + B?] = a?(a — BY 


Thus b? — 4ac = 0 if and only if a = £. | 


This Proposition can also be seen immediately through the quadratic formula, 
since the roots are 


_ —b + Vb? — 4ac 
~ 2a ; 


If b* — 4ac is not zero then there are either two different roots (b? — 4ac > 0) or no 
real roots (b* — 4ac < 0). 


Example 109. Suppose (m + 2)x? + 4x + (m— 1) = 0 has two distinct real roots 
and m € Z. Find all possible values of m. 
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Solution. Since (m + 2)x? + 4x + (m — 1) = 0 has two distinct real roots, we 
know that A = A(m) (which is a function of the integer m) must be positive. We 
can write this condition as 


A(m) = 16 —4(m + 2)(m — 1) > Oor 
A = 16—4(m + 2)(m— 1) = —4(m? + m — 6) = —4(m 4+ 3)(m — 2) > 0. 


Thus (m + 3)(m — 2) < 0, which means that the two factors must have different 
signs and so —3 < m < 2. Note that m = —2 is not a possible value for m since 
(m+2)x* +4x + (m—1) = 0 would then be a linear equation having only one root 
—— :. Because the problem states that m must be an integer, the only possibilities 
are m = —1,0 or 1. It is easy to check that each of these values of m works. // 


Example 110. Let a, b, and c be the lengths of the three sides of the triangle 
A ABC. If a # c, show that the equation (a — c)x? + 2bx — (c —a) = 0 has 
two distinct real roots. 


Solution. Our plan is to look at the discriminant of the quadratic given in the 
Example and show it is positive. By definition, 


A = 4b? + 4(a—c)(c —a) 
= 4(b* — (a—c)”) = 4(b— (a—c))(b4+ (a—-c)). 


Thus 
A= 4((b+c)—a)((b +a) —Cc). (5.1.4) 


Since the sum of the lengths of any two sides of a triangle is larger than the third, 
both factors of Eq. (5.1.4) are positive so that A > 0 and Theorem 19 of this section 
shows that the equation has distinct real roots. // 


Exercise 65. Assume that (a + b)x? + 2cx + (a — b) = 0 has two equal roots. 
Prove that AABC is a right triangle (where a, b, and c are the lengths of the sides 
of AABC). 


Problem Set 5.1: 


Note: You could use Synthetic Division or direct division by polynomials if 
applicable. 
1. Is (x — 2) a factor of x? — 4x? + 5x +1? 
. Is (x + 1) a factor of x3 — 5x? + 2x + 8? 
. Is (x — 3) a factor of x3 — 3x? — 13x + 15? 
. Is (x — 1) a factor of 2x? — 3x? + 14x — 1? 
. Is (2x — 3) a factor of 6x? — 19x? — 7x — 12? 
. Find s so that x + 2 is a factor of x* — 3x? + sx? +x +5. 


NNW N 


5.2 


14. 


15. 


16. 


17. 


18. 


19. 
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. Find ¢ so that x + 3 is a factor of x*+ — 7x? + tx? — 18x — 9. 

. Find r so that x + 5 is a factor of x+ + 3x? + rx? — 33x + 10. 

. Find k such that x + 1 is a factor of 2x? + 3x? —kx + 10. 

. Findr ands so that f(x) = x? + rx? +sx +7has x —3 and x + 4s factors. 
. Find k and t so that f(x) = x7 ++kx* +tx + 16 has x +2 and x —4 as factors. 
. Find m and n so that f(x) = x? + mx? + nx — 60 has x + 5 and x + 3 as 


factors. 


. An eighth-grade student, Bluma, is trying to solve the equation 5x? — 27x + 


10 = 0. She first rewrites (incorrectly) the equation as 
x? — 27x + 5(10) = 0 (5.1.5) 


and factors that equation to get x = 2 or 25. Then she realizes that she 
multiplied the constant term by five and so she divides her answers to Eq. (5.1.5) 
by 5 and claims the solutions of the original are z and 5. Are these solutions 
correct? Whether her method is correct or not is explored in Problem 14. 

(continuation of Problem 13) Let f(x) = ax* + bx +c and let g(x) = x7 +4 


bx + ac. If a and B are the roots of g(x), prove that the roots of f(x) are © 
and B (This means that Bluma’s procedure in Problem 13 is correct. See Clark 
and Millman [2].) 

Let (x + a)(x +b) + (x + b)(x +c) + (x +:c)(x +2) = 0 have two equal 
roots and assume that a, b, and c are the lengths of the sides of AABC. Then 
what kind of triangle is A A BC. (Prove your assertion.) 

Let f(x) be the monic cubic polynomial f(x) = x? + ax? + bx +c. If f(x) 
has roots a, 6, and y, find formulas for a, b, and c in terms of a, B, and y. 

If the cubic f(x) of Problem 16 is not monic, how would you modify your 
formulas? 

If m,k € Z and the quadratic equation 


x? —4(m — 1)x + (3m? — 2m + 4k) =0 


has two equal solutions, what are the possible values for m and k? 
If k,m € Z, for what values of m will the equation 


x? —4(m — 1)x + (3m? —2m + 4k) =0 


have two equal solutions? 


5.2 Rational Roots of Polynomials 


Overview The Factor or Root Theorem of the previous section (Theorem 18) 
provides the condition for existence of a root of a polynomial f(x). However, it does 
not give a way to find such a root. In mathematical language, the Factor Theorem 
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does not give a “constructive condition”, meaning that there is no procedure (or 
algorithm) contained in the proof of the Factor Theorem to show how to construct a 
root c such that x — c is a factor of f(x). You can already easily obtain the roots of 
linear and quadratic equations polynomials (or show that there are no real ones). 

Rather than talking about a test for roots, this section is dedicated to presenting 
a constructive method for actually finding rational roots of polynomials whose 
coefficients are all integer-valued (the Rational Root Test, Theorem 20 below). The 
procedure works well when the roots are rational numbers, but gives no information 
about the roots that are not rational numbers. 


Definition. If f(x) is a polynomial and r is a rational number with f(r) = 0, 
then r is called a RATIONAL ROOT of f(x). 


Notation. We denote by Z[x], the set of all polynomials in x, each of whose 
coefficients are integers. We denote by Q|x], the set of all polynomials in x all of 
whose coefficients are rational numbers. We write R[x] as the set of all polynomials 
whose coefficients are real numbers. 


Exercise 66. 


(a) Show that if f(x) and g(x) are in Z[x], then both f(x) + g(x) € Z[x] and 
F(x)g(x) € Z[x]. 


(b) Prove the same statement for Q[x]. 


Exercise 67 (Prove or Disprove). If all coefficients of f(x) are integers, then all 
of the real roots of f(x) must be integers. 


The Rational Root Theorem is a simple criterion for a polynomial in Z[x] to have 
(or not to have) a rational root. This criterion translates into a five step procedure to 
find the rational roots of a polynomial with integer coefficients. 


Theorem 20 (Rational Root Test). Let f(x) = do + ajx +++» + a,x" € Z[x] 
be of degree n. If r is the rational number *, expressed in lowest terms, and r is a 
rational root of f(x), then s|ao and t\dy. 


Proof. Since + is a root of f(x) we have 


(Qa (f teiGyTensa(em=0 6a 


This implies that 
dot" +ayst" | +++ +ay,—-\s" 't t+ a,s" = 0. 
Therefore 


n—2 


Hat tant 2 fas) = <a". (5.2.2) 
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This shows that t|a,s". But because s and t are expressed in lowest terms, 
GCD(s,t) = 1 [see Eq. (2.1.4)]. By Proposition 6, we obtain t|a, from Eq. (5.2.2) 
since t must divide either a, or s. Similarly we can use, from Eq. (5.2.1), 


ayst” | 4s + ay—18" 't + ays” = —aot”. 


The analogous argument shows that s|do. a 


Remark. Stated simply, if the rational number in lowest terms is a root of the 
polynomial, its numerator divides the constant term and its denominator divides 
the coefficient of the degree of the polynomial. 


Exercise 68. Prove the last assertion of Theorem 20. 


It is important to understand that the Rational Root Test gives only a necessary 
condition for f(x) € Z[x] to have r = ¢ as a root. This means that if r is a 
root of f(x) then s|do and t|a,. It does NOT say that every rational number } for 
which these conditions hold is a root. As an example, if f(x) = 3x? + x + 1 has 
r= f= i as a test, we have s|do and t|az, but f(%) # 0. It is a similar situation 
to that of the elections in the United States in the sense that the Rational Root Test 
is like a preliminary election (the “primaries”). Once the primary is over, there is a 
limited number of candidates (rational numbers in our case) to consider to be elected 
(to actually be roots). 

The advantage of such a necessary condition as the Rational Root Test is that 
there are not many cases to test for the rational roots. More precisely, we look 
towards those of the form r = : where s|do and tla,. That statement shows 
the tremendous advantage of this approach because we are then left to check 
mechanically a few cases as we will illustrate in Example 111. 

The Rational Root Test Theorem provides us with the following procedure to 
find a rational root of f(x) = dy + ayx +--+ + a,x" € Z[x]. We know that for a 
rational root an s must divide the constant term, ¢ the leading coefficient, and s and 


t must be relatively prime. 
Rational Root Procedure 


Step 1. Find all possible (integer) factors s of the constant term do. 

Step 2. Find all possible (integer) factors ¢ of the leading coefficient, ay. 

Step 3. Use Steps | and 2 to find all possible reduced values of *. 

Step 4. Pick one of the possible roots r = + from Step 3 and see whether it is a root 
(see the remark below). If it is, proceed to Step 5. If not, try another possible 
rational root from Step 3 until you’ve successfully found a rational root. 

Step 5. To find all rational roots you can either check all the possibilities of Step 
3 or use one successful r = + to factor f(x). Usually, it is faster to do the 
factoring and find g(x) where 


f(x) = *—r)g(a). (5.2.3) 
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The other rational roots of f(x) are then the rational roots of g(x) of Eq. (5.2.3) 
and so the same procedure may be applied to g(x). It is also useful to look at the 
factored polynomial (i.e. at g(x)) to see if you can find some other rational roots 
by factoring or using the quadratic formula. In addition, you may also be able 
to show there are no further roots by using another technique (e.g. the quadratic 
formula). 


Remark. Step 4 asks you to check whether a possible root x = r = is actually a 
root or not of f(x). There are two ways to see if r is a root. One is to compute f(r) 
and check if it is zero and the other is to divide f(x) by x — r using long division 
or synthetic division and see if the remainder of f(x) on division by x — r is zero 
or not. The choice is yours—whichever you feel will be faster. 


Example 111. Use the Rational Root Test to find rational roots of the equation 
15x? + 43x +8=0. 


Solution. If r = + is a rational solution then s|8 and ¢|15. Thus the possibilities 
are: 


s = +1,+2,44, +8 andt = £1,+3,+5,+15. 


The possibilities for 7 to be tried out are 


1 1 1 2 2 2 4 4 4 8 68 8 
Es oP Py oe SD oe oe Bo 
3.5 #15 3.05 15 3. 5 #15 3.5 #15 


which is a rather long list (but is finite). (The answer turns out to be —t and —§) // 


To see how to do this example by factoring (and to review factoring), see the 
appendix to this section. 


Remark. It is important to note that the Rational Root Test is only valid for finding 
roots that are rational numbers. It says nothing about other possibilities. As an 
example, using the rational root test on x? — 2 yields the fact that there are no 
rational roots (the only possibilities are +2 and +1). Of course, there are two roots 
(+/2)—they are just not rational numbers. In fact, we have just used the rational 
root test to reprove the following theorem. 


Theorem 21. The number V2 is not a rational number. 


Proof. The discussion of the preceding paragraph shows the only possible rational 
roots of x? — 2 are +2 or +1. As we have assumed that the square root of any non- 
negative real number is a real number, we may actually assert from Theorem 21 that 
the /2 is irrational. i 


Remark. In this section we use the foundation we’ve built (the machinery of greatest 
common divisors from Sect. 2.1) to solve problems as we did in Theorem 4 of the 
first section of Chap. 2. 
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Remark. We have given the conceptual framework for understanding why the usual 
approach to solving polynomials in the schools “through factoring of polynomials 
with integer coefficients” is actually valid. 


For Exercises 69 and 70 , you should use the Rational Root Theorem in a similar 
way. 


Exercise 69. Prove that /3 is irrational. 
Exercise 70. If p is a prime number, prove that ,/p is irrational. 
Example 112. Find the rational roots of f(x) = 2x? — 5x? — x +6. 


Solution. We follow the Rational Root Procedure previously outlined, looking for 
rational roots of the form +, where s and f are relatively prime. 


Step 1. In order to find possible values of s we must find all possible factors of 
dao = 6. This results in s € {+1, +2, +3, +6}. 

Step 2. Find all possible values t we look for factors of a, = 2, which gives t € 
{+1, +2}. 

Step 3. To find all possible values of +. There are only 12 possible rational roots. 
They are +1, +2, +3, +6, +5, and +3. 

Step 4. Choose one of the possible values of + from Step 3. We’ll try x = 1 since it 
makes the arithmetic easy. Unfortunately for us, f(1) = 2 so x = 1 is not 
a root. 
Choosing another possible value, we will now try x = —1. In this case, 
f(-1) = 0 so —1 is a rational root. We may therefore divide f(x) by 
x —(—1) = x + 1 and obtain 


f(x) = 2x? — 5x? —x +6 = (x + 12x? -—7x +6) +0 
= (x + 1)(2x — 3)(x — 2) 


where the quadratic term factors easily as (2x — 3)(x — 2). Therefore the 
three rational roots of f(x) are x = —1, 3, and 2 since f(x) is of degree 3 
and so there are at most 3 roots, we have found all (three) of the roots. 


It is certainly true that in Example 112, we were “lucky” when our second guess 
(x = —1) turned out to be a root (and hence we could factor x +1 out from f(x) and 
finish the problem easily). We’ll now revisit the same problem to give a useful and 
more methodical way to eliminate some of the candidates for s and t. That approach 
is based on the Factor/Root Theorem (Corollary 18). 


Example 113 (Example 112 Revisited). Find the rational roots of f(x) = 2x? — 
5x? —x + 6. 


Solution. Proceeding to Step 3 above gives 12 possible values of +. However, from 
the Factor Theorem we know that f(x) has the linear polynomial (x —c) as a factor 
if and only if f(c) = 0. Thus, if + is aroot of f(x), there is a polynomial g(x) such 
that (x - a) g(x) = f(x), or 


154 5 Factoring Polynomials, Their Roots, and Some Applications 


(tx — s)g(x) = tf (x). 
We therefore see that if + is a root of f(x) then, f(x) is divisible by 
(tx —). (5.2.4) 


It is this last statement that allows us a further restriction of the possibilities for 
> in our example. Since f(1) = 2—5—1+6 = 2andtx—s =t—s whenx = 1, 
it must be that (¢ — s) divides 2. That equation allows all of the possibilities for > 
except +6 and —5. We have eliminated 3 of the 12 possibilities. What information 
does an application of (5.2.4) with x = —1 yield? // 


Example 114. Let a,b € Z. If f(x) = x? + ax? + bx + 3 has 3 distinct rational 
roots, find the values of a and b. 


Solution. By the rational roots theorem, we know that the possible candidates for 
rational roots are +1, +3. Since there are 3 distinct rational roots, and four choices, 
we write 4 equations with a and b for each of the possibilities +1, +3. 

If x = 1 is aroot of f(x) = 0, then 


l+a+b+3=0ora+b=—4. (5.2.5) 
If x = —lisaroot of f(x) = 0, then 
—-l+a—b+3=0ora—b=-2. (5.2.6) 
If x = 3 is aroot of f(x) = 0, then 
27+ 9a+ 3b+3=00r3a+b=-—10 (5.2.7) 
If x = —3 is aroot of f(x) = 0, then 
3a—b=8. (5.2.8) 
The first two of these equations gives that a = —3 and b = —1. Does a = —3 and 
b = —1 solve either (5.2.7) (in which case x = 3 is a root) or (5.2.8)? Yes, a = —3 
and b = —1 satisfy (5.2.7) so the roots are x = 1,—1 and +3 anda = —3 and 
b=-1. 


If we had not been successful in finding the roots in our first guess, {1,—1, 3}, 
we would have tried all 4 possibilities until one actually worked. // 


Example 115. Letm,k € Z. Assume the two real roots of the following equation 
are equal and in fact are rational numbers. 


f(x) = mx? + (m+ 1Ixt+(k +1) =0 (5.2.9) 
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(a) Prove that k(k + 1) must be the square of an integer. 
(b) Find the possible values of m and k. 


Solution. For Eq. (5.2.9) to have equal roots, its discriminant must be zero. In 
equation form, this means: 


A(m) = (m + 1)? —4m(k +1) = m? —- (24+ 4k)m +1 = g(m). (5.2.10) 


Let us regard A(m) as a quadratic polynomial in m which we’ll call g(m). Since 
we are searching for solutions to (5.2.10) with m and k integers, it must be that the 
solutions to g(m) = 0 are integers. Thus the quadratic formula applied to (5.2.10) 
yields 


malt 4k) + = 4k)? —4 ror 
_ (2+4k) + /16(k + k?) 
~ 2 
= (14+ 2k) t2Vk4+k?. 


Thus &(A + 1) must be a perfect square. If k(k + 1) = 0, thenm = 1 + 2k and 
either k = 0 or k = —1. Thus m = 1 orm = —1 and there are two possibilities: 


f(x) =x? +2x+1(k =0) or f(x) = x? (k =—1). (5.2.12) 


We’ ll now solve Part(b) and show that there are no other solutions. If k(k + 1) isa 
positive perfect square it has a decomposition into prime factors as 


k(k +1) = ptt-+ pr (5.2.13) 


where each of the p; are primes and 1,,...,¢ are even integers. Since k andk + 1 
are relatively prime, they can have no common factors, in particular, no common 
prime factors. Thus Eq. (5.2.13) shows that both k and k + 1 would be the product 
of primes, each an even number of times. However, both & and k + 1 must both be 
squares (and positive) and this isn’t possible. This can happen if they are | and 2 (as 
there are no other consecutive squares). However, when k = 1,k(k+1) = 2, which 
is not a perfect square. Thus Eq. (5.2.13) gives the only solutions to the problem: 
k =Oandm = 1lork =—landm=-l. // 


Once again we have solved a problem by using a foundation built in earlier 
sections. 


Exercise 71. Assume g(x) = x? +ax? + bx +6 € Z[x] has three distinct rational 
roots. Find the values of a and b (if any exist). 
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Problem Set 5.2: 


1. Which of the 12 possibilities of Step 3 of Example 112 are eliminated if one uses 
(5.2.4) with x = 2? 

. Find all rational roots of f(x) = x? — 6x? — 7x + 60. 

. Find all rational roots of h(x) = x? + 27. 


. Find all rational roots of f(x) = x =o = x + 20. 
. What are all rational roots of f(x) = 3x° + 5x? — 16x — 28? 
. Show that the polynomial 3x? — 3x + 1 = 0 has no rational roots. 


. Let f(x) = x* — 3x? -6. 


NYDN WH 


(a) Find all rational roots of f(x). 
(b) Find all roots of f(x). (Of course, depending on your response to Part A, 
you may have to use methods that are not in this section.) 


8. Using the Rational Root Test, find all possible rational roots of the following 
(you are not expected to find all rational roots, just the possibilities given by the 
Rational Root Test). 


(a) 9x4 — 24x3 — 2x? -24x +9=0 
(b) 6x* + x3 + 14x? - 14x +3 =0 
(c) 2x3 + 3x7?-2x+3=0 


9. Assume f(x) = 9x4 +ax3+bx? +cx +1 € Z[x] has 4 distinct rational roots. 
Find the values of a, b, and c. 


5.2.1 Appendix to Sect. 5.2: A Brief Review of Factoring 
Quadratics 


Remark. With some practice you will begin to learn how to pick the right numbers 
without exhausting all possible cases, thus saving a substantial amount of work. 


Remark. The quadratic equations used in this method were carefully selected so 
that they could be solved by factoring. There are, however, many quadratics that can 
not be solved in this way. For example, the equation x? — 8x — 3 = 0. 


Example 116. Solve 15x? + 43x + 8 = 0. Herea = 15,5 = 43, andc = 8. 


Solution. Steps: 


1. Find all possible pairs of factors of a and c. The possibilities are : 
a=15=15-1=5-3andc =8=8-1=4-2. 


2. Use of a pair of integers which factors a = /m and write (4/x+_ )(4mx+_ ). 
For instance, using the pair of 15 and 1, write the form (15x + _ )(x + _ ). 
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3. Try all possible pairs of factors of c to fill the blanks in step 2. First we try 8 and 
1 in the blanks, both ways, namely (15x + 8)(x + 1) and (15x + 1)(x +8). Try 4 
and 2 in the blanks, both ways, namely (15x + 4)(x + 2) and (15x + 2)(x + 4). 
(You should also do the same for the case in which c is factored as (—2)(—4) or 
(-1)(-8).) 

4. Check the middle term of each product in step 3. If the value of b (b = 43 in 
our example) which is the middle term of the product in not equal to 43x for any 
possibility, then go to step 2 again. It is, unfortunately true that in this example 
the middle term does not work out for any of these four possibilities. 

We now go back to step two, factorizing a = 15 as 15 = 5-3. We consider the 
pairs 5 and 3, write (5x + _ )(3x-+_ ). Once again, try 4 and 2 in both ways, and 
8 with 1 both ways. We try (5x + 4)(5x +2), (5x+2)(3x+4), (5x+1)(3x+8), 
(5x + 8)(3x + 1). The middle term of the product of (5x + 1)(3x + 8) is 43x, 
hence (5x + 1) and (3x + 8) are the solutions for the factors, and we are finished 
without having to check the factors (—4)(—2) or (—8)(—1). 

5. Solve each quadratic equation by factoring: 


(a) x*-—7x+12=0 
(b) 6x7 +x-15=0 
(c) 2x? —7x —30=0 I 


5.3. Greatest Common Divisors and Least Common 
Multiples for Polynomials 


Overview We continue with results for polynomials which are similar to those of 
Chap. 2 for natural numbers and integers. We first define the GCD and LCM of 
polynomials which are parallel to that of the GCD and LCM of integers (Table 5.1). 
Corresponding to the Division Algorithm for integers (page 17 of Sect. 1.1), 
there is a division algorithm for polynomials (Theorem 15). Therefore, it should 
not be surprising that there is also a Euclidean Algorithm for Polynomials to find 
the greatest common divisor of two polynomials which we will now define and 
then prove the algorithm. This procedure comes from the division algorithm for 
polynomials, just as the Euclidean Algorithm for integers (Theorem 7) comes from 
the Division Algorithm for integers. The proof of Theorem 22 is exactly the same 
as that of Theorem 7 but with polynomials instead of integers and we will omit it. 


Table 5.1 Example 116 15,1 8,1 
5,3 4,2 
—15,-1 | —8,-1 


5-3 | -4,-2 
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Definition. Let f(x) and g(x) be non-zero polynomials over R. A monic 
polynomial d(x) over R is the GREATEST COMMON DIVISOR (GCD) OF f(x) 
AND g(x) if 


(i) d(x)| f(x) and d(x)|g(x) 
(ii) if k(x) € R[x], k(x)| f(x), and k(x)|g(x), then k(x)|d(x). 


The greatest common divisor of f(x) and g(x) is usually denoted by 
GCD( f(x), g(x)) = d(x) which means that d(x) is a common divisor of both 
F(x) and g(x) [Part (i)] and d(x) is the largest common divisor [Part (ii)]. 

The Euclidean Algorithm for polynomials (Theorem 22 below) is similar to the 
Euclidean Algorithm for finding the GCD of non-zero integers just as the definition 
of GCD( f(x), g(x)) mimics the definition of the GCD(a, b) if a and b are non-zero 
integers. 


Definition. If GCD( f(x), g(x)) = 1, then f(x) and g(x) are called RELATIVELY 
PRIME POLYNOMIALS. 


Theorem 22 (Euclidean Algorithm for Polynomials). Let f(x) and g(x) be 
polynomials over the R such that deg f(x) => deg g(x). Then applying the division 
algorithm gives a sequence of quotients (qo(x) qi(x),...) and remainders (r(x) 
r\(x),...) such that 


f(x) = qo(x)g(x) + ri(x), deg ri(x) < deg g(x) 

g(x) = qi(x)ri(x) + r(x), deg ro(x) < deg ri (x) 

r(x) = qo(x)ro(x) + 73(x), deg r3(x) < deg ro(x) 
Tn—2(X) = Gn—1(X)Pn—1(X) + Tn (X), degr,(x) < deg r,—1(x) 


Tn—1(X) = Gn(X)T n(x), 

where r(x) is the last non-zero remainder. If a is the leading coefficient of r(x) 
then GCD(f (x), g(x)) = a7! r,(x). 
Example 117. Find GCD(x> + 6x? — 49x + 42, x4 + x3 — 9x? — 3x + 18). 
Solution. We use division of polynomials and follow the procedure and notation of 
Theorem 22 with f(x) = x>+6x?—49x +42 and g(x) = x4+2x3—9x?—3x4+ 18. 
F(x) = q(x)g(x) + r(x) is found by division to obtain: 
x° + 6x? — 49x + 42 = (x — 1)(x* + x3 — 9x? — 3x + 18) + 10x? — 70x + 60 

—— =—«___"" 

qo(x) ri (x) 


Using the notation of the theorem, the above equation gives Theorem 22 for which 
g(x) divided by 7;(x) and comes up with 
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Table 5.2. GCD(x° + 6x? — 49x + 42, x4 + x3 — 9x? — 3x + 18) 


x x? + Oxt + Ox? + 6x? — 49x + 42 | x* + x3 — 9x? — 3x + 18 px 
x? + x4 — 9x3 — 3x? + 18x x* + Ox? — 7x? + 6x 

=1 | —x4-+ 9x3 + 9x? — 67x + 42 x3 — 2x? — 9x + 18 - 
—x4— 1x3 + 9x? + 3x —18 x3 —7Tx+6 

—5x | 10x? + 0x? — 70x + 60 —2x? —2x + 12 
10x? + 10x? — 60x 

+5 | —10x?— 10x + 60 
—10x? — 10x + 60 
0 


1 
x bg? = 0x =3x bh 1s= Ties 1) Cte = 70% 60) + (247 = 3x = 12) 
=—e__——_—___——"" —__———_—— 


ae gi(x) r(x) 


so that gi (x) = p(X + 1) and ro(x) = —2x? — 2x — 12. Similarly, 


10x? — 70x + 60 = (—5x + 5) (—2x? — 2x + 12) +0 
a 
q2(x) r2(x) 
so that qo(x) = —5x + 5 and r3(x) = 0 and the process finishes in 3 steps. The last 
non-zero remainder is r2(x) = —2x? — 2x + 12 whose degree is 2 (and r3(x) = 0). 


Since the GCD of two polynomials must be a monic polynomial, we divide the 
remainder by —2 to obtain 


GCD(x> + 6x? — 49x + 42, x4 + x3 — 9x*-— 3x4 18) =x? +x-6. 
// 
The above calculation can be rewritten as follows (Table 5.2): 
Example 118. Find the GCD(x* + 3x — 2,2x3 + x7 + x + 6). 


Solution. Using the Division Algorithm multiple times (as in the previous 
example), 


1 1 1 i 4 
x643x-2= (5*-3) (2x3 + x7 -+x+6)+ (-j"+ i*-3) 
(5.3.1) 


2x? +x? +x =—6 = (—8x = 12) sg aoe +0 
4 4 2 


Then r,(x) = —}x* + 4x — 5 and hence GCD(x‘ + 3x —2,2x3 + x7+x+6)= 
x? —x +2. // 
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Again the above calculation can be rewritten in table form as follows: 


$x x44 0x3 + 0x27 +3x—-2 | 2x8 +x?7+x+6 | —8x 
xt 5x3 as 5x? + 3x 2x3 — 2x? + 4x 

i Ti 2 3x2 —3x+6 | —12 
ae $x? res 3 3x? — 3x +6 
axe + ix— 4 0 


Hence GCD(x*+3x—2, 2x?+ x7 +x+6) = x?—x+2 since txt + ix—4 is 
not monic. In order to avoid arithmetic involving fractions, we can do the calculation 
in the following way: 


x x4 + 0x3 + Ox? + 3x —2 23+ x2? +x+6 2x 
2x4 + 0x3 + Ox? + 6x —4 | 2x3 — 2x2 + 4x 
xt + x3 + x2 4+ 6x 


—l —x3 — x? —4 3x? — 3x +6 —3 
—2x3 — 2x? —8 3x2 -— 3x +6 
—2x3 — x? —x-6 
x?-x+2 0 


Exercise 72. Find the following GCD( f(x), g(x)) if 


(a) f(x) =x44+ x3 -x? +x —2and g(x) = x31. 
(b) f(x) =x* +23 —x? +x —-2, g(x) = 3xt-— x3 —x? —x. 


Exercise 73. Let c € R, f(x) = 2x? — 4x? + 2x + (2c + 4), and g(x) = 3x3 — 
6x? + (3c + 5). If GCD( f(x), g(x)) is a linear polynomial, then what is the value 
of c? 


There was an easier way in Chap.2 of computing the GCD of the integers a 
and b. That approach involved proving that there are integers x9 and yo such that 
GCD(a, b) = xoa + yob. That result was called the Euclidean Byproduct Theorem 
(for integers) and is Theorem 8. Between the integers and polynomials, we now 
show that a similar type of result works for the GCD of two polynomials. 
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Theorem 23 (Euclidean Byproduct for Polynomials). [f f(x), g(x) € R[x] and 
are not zero, then there are real polynomials a(x) and b(x) such that 


GCD( f(x), g(x) = a(x) f(x) + bx) g(x) 


Example 119. Find polynomials a(x) and b(x) such that 


GCD(x* + 3x — 2,2x37 + x? + x + 6) = a(x) (x* + 3x — 2) th(x) (2x7 + x7 +x46). 
—_—"" _—_—_—_—_—_——" 
f(x) g(x) 
(5.3.2) 


Solution. From Example 118, we know that GCD( f(x), g(x)) = x7 — x +2. 
Equation (5.3.2) is identical to 


i. 4 1 1 1 
wt 3x-2=(Fx- 7) O04 et tx 464 (—j2 + Gx-5). 


Thus, we have 


A(x* + 3x —2) = (2x —1)(2x3 + x7 +x +6) + (—x? +x —2) or 


fag = 4 3x S20) S0e KS 1x ex Se £6). 
—— —<<$<$<_,_ —_—_—_<_<— 
f(x) g(x) 
Hence a(x) = —4 and b(x) = 2x - 1. // 


Exercise 74. Continuing Exercise 72A, find a(x), b(x) € R[x], such that 
GCD(x* + x3 +2x? 4x41, x3=-1) = a(x) (x4 +42 42x? 4+%41)4+b(x)(x?-1). 


Recall that for integers a, b, and d, if dja and d|b then d|(ka + ¢b) for any 
k,£ € Z. (Exercise 2, Sect. 1.1). The statement which follows (Proposition 21) is 
the analogy in polynomials to the statement for integers just mentioned. Its proof is 
left as Problem 2 of Problem Set 5.3. 


Proposition 21. If d(x), a(x), and b(x) are non-zero polynomials with d(x)|a(x) 
and d(x)|b(x), then d(x)|k(x)a(x) + £(x)b(x) for any polynomials k(x) and £(x). 


Example 120. Let f(x) = ax? + bx + 1 and g(x) = bx? + ax + 1. If the 
GCD( f(x), g(x)) is a linear polynomial, what is a + b? 
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Solution. The key to our method is to consider f(x) — g(x). By virtue of 
Proposition 21 with k(x) = 1 and é(x) = —1, the GCD(f, g) must divide 
F(x) — g(x). But 


f(x) -— g(x) = (a- b)x? + (b—a)x = (a—b)x(x—-1). 


Because we’ve assumed that the GCD( f(x), g(x)) is linear, it is either x or x — 1. 
We now show that GCD( f(x), g(x)) # x. If x divides f(x) = ax? + bx +1 
then there is a g(x) € R[x] with 


f(x) = ax? + bx +1 = xq(x). 


But then f(0) = 1 = 0q(0) = 0, which is impossible. 
Thus the GCD( f(x), g(x)) = x — 1. But, since GCD( f(x), g(x))| f(x) (and 
g(x) also) there is a polynomial p(x) with 


ax? +bx+1= (x—1)p(2). 
Evaluating the above when x = 1, we have a + b + 1 = 0 which gives the desired 


result of —1. // 


Example 121. If f(x) = x°° — 2x? — 1 and g(x) = x® — 3x? — 4, find 
GCD( f(x), ¢(x)). 


Solution. If d(x) isa common divisor of f(x) and g(x), then Proposition 21 shows 
that d(x)|(f (x) — x?g(x)). However 


f(x) — x°g(x) = (-2x? — 1) — (-3x* — 4x7) 
= 3x4 4 2x? —1 = (3x? - 1)(x? + 1). 
Thus any common divisor, and in particular the greatest common divisor, must 
divide (3x? —1)(x? + 1) and so the GCD( f(x), g(x)) is either +(3x?—1) or x? +1. 


We show that the first possibility doesn’t work. If 4 (3x? —1)| f(x) then f (=) 
must be zero. However, 


H)= (a) 6) 0) 


which is certainly not zero. Thus the GCD( f(x), g(x)) = x? + 1. // 


Definition. Let f(x) and g(x) be two polynomials over R. The monic polynomial 
c(x) is said the be the LEAST COMMON MULTIPLE of f(x) and g(x) if 


(i) f(x)|e(%), g(x)|e(x), and 
(ii) if for any e(x) € R[x], whenever f(x)|e(x), g(x)|e(x), then c(x)|e(x). 
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The least common multiple (LCM) for f(x) and g(x) will be denoted 
by [f(x), g(x)] = LCM(f(x), g(x)) and (f(x), g(x)) will be used for the 
GCD(f(x), g(x). 


Proposition 22. [f f(x) and g(x) are non-zero polynomials then 
f(x)g(x) = LCM(f(x), 8(x))GCD( f(x), g(x). 


Proof. Let d(x) = GCD(f (x), g(x)). Then we have, since d(x) must divide both 
F(x) and g(x), polynomials k;(x) and k(x) such that 


f(x) = d(x)ki (x) 
g(x) = d(x)ko(x) where GCD(k, (x), ko(x)) = 1 


From the previous equations, f(x)|d(x)k,(x)k2(x) and g(x)|d(x)ki(x)k2(x). 


Therefore LCM( f(x), g(x)) = d(x)ki(x)ko(x) = fee) = aoe 
and 


F(x): g(x) = LOM(f (x), 8(x))GCD( f(x), g(x). = 


The above equation can be use to find LCM( f(x), g(x)) if f(x) and g(x) are 
given. Namely, we find GCD( f(x), g(x)) first. 


Example 122. Find LCM(x‘ + 3x —2,2x3 + x2 +x +6). 

Solution. From Example 118, we know GCD(x* + 3x — 2,2x3 + x7 +x+6)= 

x? — x + 2. Hence 

(c* 4 3 = 2)? 4 a 4+ 6) 
x2—-x+2 

= (x* + 3x — 2)(2x + 3). 


LCM(x* + 3x —2,2x9 + x7 +2x%+6 = 


// 


Exercise 75. Find LCM(f(x), g(x)) if f(x) = 4x4 — 4x3 + 5x? — 4x + 1 and 
g(x) = 8x — 6x? + 5x — 2 using Proposition 22. 


Problem Set 5.3: 


1. Find GCD( f(x), g(x)) and LCM( f(x), g(x)) if 


(a) f(x) = x? + 4x? — 8x + 24 and g(x) = x° — x4 + 8x? — 8x. 
(b) f(x) = x* + x3 — 9x? — 3x + 18 and g(x) = x° + 6x? — 49x 4+ 42. 
2. Prove Proposition 21. 


3. Let f(x) = x?+ax+be and g(x) = x?+bx-+ac, and let GCD( f(x), g(x)) = 
x —d. Find LCM(f (x), g(x)). 
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. Let f(x) = x? — 2x? + ax —3, g(x) = x* — bx + 2, where a,b,c € N, and let 


GCD(f (x), g(x)) = x —c. Find LCM(f (x), g(x)). 


. Let f(x) = x* + ax +b and g(x) = x? + bx +a where a,b € R. If the 


GCD(f (x), g(x)) is linear, find LCM( f(x), g(x)). 


Chapter 6 
Matrices and Systems of Linear Equations 


Introduction This chapter discusses the solutions of a system of linear equations 
by viewing the solutions as a set and interpreting that set geometrically. Matrices and 
operations on matrices are briefly reviewed and their effect on the related solution 
sets is also described geometrically. 


6.1 Matrix Operations 


Overview The key conceptual point and reason that matrices are so important is the 
relationship between matrices and the solution set of a system of linear equations 
which, will be discussed in the next section. Because matrices and their operations 
are introduced in school mathematics (for example, in Algebra 2 courses in high 
schools in the United States), we will not provide much detail in this section. 

We will pay attention mostly to the cases of 2 or 3 dimensional Euclidean space 
(also called 2-space or 3-space) or equivalently, 2 or 3 equations in 2 or 3 unknowns 
because the basic ideas of the subject are in 2 and 3 space. Everything extends easily 
to higher dimensions but that generality is one taken in a college level course “linear 
algebra”. Restricting to 2 or 3 dimensions saves some formalism and much notation 
which allows the underlying ideas of the subject to be clearer. 


Definition. An (7m x1) MATRIX is a collection of real numbers arranged in m rows 
and 1 columns. The SIZE of such a matrix is m x n. The m x n ZERO MATRIX is 
one in which all entries are zero. We write 0 for the zero matrix of any size. 


We will write a capital letter, say A, for the matrix. The ith row and jth column 
entry of A is denoted by a;;. 
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Definition. Two matrices are EQUAL if they have the same size and their corre- 
sponding entries are the same. In symbols, two m x n matrices A and B are equal if 


aij = by forall l <i <mand1 <j <n. 


A matrix, A, can be MULTIPLIED BY A REAL NUMBER A by multiplying each 
of its entries by 4. More formally we have the following definition of scalar 
multiplication. 


Definition. If A = [a;;] is anm xn matrix and A is a real number, then the SCALAR 
MULTIPLICATION JA is the m x n matrix with entries defined by the (i, j) entry of 
AA is Aaj;;, or 


(AA);; = Aaj; foralll <i <mand1 <j <n. 

The resulting matrix, 1A, is the matrix whose (i, j ) component is A.q;,;. In other 
words, the components of 1A are exactly the components of A multiplied by A. 
Definition. In symbols, if A and B are two m x n matrices then their SUM C = 
A+ B is the matrix with entries defined by 

cy = aj + bj forall <i <mand1 <j <n. 

If A and B are matrices of the same size then SUBTRACTION of B from A is 

defined to be A+ (-1I)B = A- B. 


The matrix addition and scalar multiplication can be viewed as component 
wise operations. This reduces the matrix operations to performing addition or 
multiplication of real numbers. 


Exercise 76. For any matrix A, prove that the zero matrix 0 is the additive identity 
and —A = (—1)A is the additive inverse of A. More precisely, for any matrix A, 
prove that: 


1 A+0=A 
2. A+(—A) =0. 


This exercise can be done easily component-wise. 


Definition. If A is n x n then its DIAGONAL elements are a;; where i = 1,...,n. 
A matrix A is called a DIAGONAL MATRIX if all non-diagonal entries are zero. 


One very special and useful diagonal matrix is the identity, /. 
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Definition. The m x n matrix J in which each element on the diagonal is 1 and 
all others are zero is called the MULTIPLICATIVE IDENTITY or, more simply, the 
IDENTITY. Note that J can be written component-wise 


1 ifi=j 
0 ifi x7 
where 1 <i <nandIl <j <n. 


To define matrix multiplication, it is convenient to first introduce a symbol to 
show the sum of a certain finite set of numbers {x;}. 


Definition. If x, is a real number for each integer k between £ and n inclusive, then 
n 


the symbol me x; is the SUM of {xx} for k between £ and n inclusive, or 
t 


n 
Soe = Xe X41 Het Xn-1 + Xn. 
e 


The left hand side of the previous equation is called “summation notation”’. 


Exercise 77. If x, = k show that 


n 


Sue mee 


k=1 


by using Gauss’s insight about arithmetic progressions. 


Exercise 78. Looking at a high school algebra text, use summation notation, and 
arithmetic and geometric progressions to express the following sums. 


A. The sum of an arithmetic progression whose first term is a and whose common 
difference is d. 

B. The sum of a geometric progression whose first term is a and whose common 
ratio is r. Assume that r # 1. 


We now turn to the definition of matrix multiplication. 


Definition. If A and B are matrices in which the number of columns of A is the 
same as the number of rows of B, then the PRODUCT of A and B, AB, is the matrix 
C = (cj;) where 


P 
Ci = > Aik bx; . 
k=1 
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Note that if A ism x p and B is p xn thenC = AB ism xn. Furthermore, these 
formulae in the 2 x 2 case are 


(6.1.1) 


EB - ie | _ Bee + aj2b2, ayb12 + a 
a Ar | | br ba aby, + d22by1 a2b12 + abr 


which is the usual result. 


It is important to note that unlike matrix addition or scalar multiplication of real 
numbers, matrix multiplication can not be viewed as a component wise operation 
and it is not necessarily commutative (if m > 1); that is, it may be that AB #4 BA 
for certain matrices A and B. 


Exercise 79. 


A. If Ais ann x n matrix, prove that 7A = A and AI = A. 
B. Show that matrix multiplication is not commutative. More precisely, find 2 x 2 
matrices, A and B such that AB # BA. 


The inverse of a matrix is very helpful in applications to numerical analysis, 
number theory, geometry, and has many uses outside of mathematics. It can be 
thought of as similar to taking the inverse of a real number. Just as nearly all real 
numbers a have an inverse (in fact, for all a 4 0), so do inverses exist for most but 
not all matrices. 


Definition. The 1 x n matrix A has a MULTIPLICATIVE INVERSE, written as A7!, 
if there exists a matrix A! such that 


AA '=A TAHT. 


If A has a multiplicative inverse, A is called INVERTIBLE. 


Exercise 80. If A = [ 6 7] show that 


_ 1 -} 
A ce I. 
2 2 


We will now associate to each matrix A a number, the determinant of A (written 
det A) which will give us a precise criterion for a matrix to have an inverse and also 
give a formula (Theorem 24) for the inverse of A. Knowing which matrices have 
inverses and how to compute the inverse is a subject of great importance in higher 
mathematics and its applications. 


Definition. If A is the 2 x 2 matrix 
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then the DETERMINANT of A is the number 


ab 
det A = =ad —be. 
cd 
If A is the 3 x 3 matrix 
Qi1 412 A413 
A = | ao) ax a3 
431 432 433 


then 


Q\1 412 443 
det A = Jay) doz d23) = 411422433 + 412423431 + 413421432 
31 432 A433 


— (441473432 + 412421433 + 13422431). (6.1.2) 


The comments of this paragraph sketch the general definition of the determinant 
and can be skipped on the first reading of the text. The above definition is valid for 
nXn matrices when n = 2 or 3 only. A careful look at the 3 x3 case hints at what the 
definition of det A should be if n > 3. Note the first step in the computation shows 
that det A is obtained by going down the first column and multiplying the (i, 7) 
coefficient by the determinant of a 2 x 2 matrix which is obtained by crossing out 
the i-th row and the j-th column. The det A is obtained by computing the alternating 
sum of those numbers, i.e. 


a a a a a a 
det A = aj, det ee ee — aj det si + a3 det casei 
432 433 431 433 431 432 


= (411422433 + 412423431 + 413421432) — (411423432 + 412421433 + 413422431) 


The last two sentences show how one would generalize the definition of determi- 
nants to the 4 x 4 case and beyond. Since we only need the 2 x 2 and 3 x 3 cases 
in this text, we will not go beyond n = 3. Furthermore, since this book is not a text 
in linear or matrix algebra, we will only use the determinant to compute inverses in 
Theorem 24 and state Cramer’s Rule in Corollary 9 and Sect. 6.4. You have already 
seen Cramer’s Rule in high school. Here we are making careful definitions showing 
that Cramer’s Rule actually works for n = 2 andn = 3. 


Remark. There are similar expression for a 3 x 3 determinant by expanding by any 
row or column. 
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Exercise 81. Compute the determinant of the following matrices. 


aan 


1-2 4 
2. |1 2-2 
O-1 3 
123 
3. 1456 
789 


The next proposition is especially useful when trying to calculate the determinant 
of a matrix. The proof in the case of n = 2 is left as exercise. The 3 x 3 case can 
be done the same way (which is sometimes called “by brute force’) although in 
advanced courses easier ways are used. We will not ask for a proof when 1 = 3 or 
higher. 


Proposition 23. If A and B are bothn x n then det(AB) = det A det B. 
Exercise 82. Prove Proposition 23 forn = 2. 


Definition. If A is an m x n matrix then A’, called the TRANSPOSE of A is the 
n X m matrix whose (i, /) entry is the (j,i) entry of A. 


As an example, note that if 


then 
1-3 
A'=|2 4 
3.5 


If A is square, then A’ merely reflects the matrix across its diagonal, or, said 
another way, A? =“flipping A across its diagonal”. The next proposition shows 
that the transpose of a product is the product of the transpose in the opposite order. 
(Remember, order matters for matrix multiplication!) 


Proposition 24. If A, and A» are both square matrices of sizen x n whenn = 2 
or 3, then 


(A, Az)" = 427A)". 
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Proof. In the 2 x 2 case, we let 


then 


So being careful of the order: 


Ast AT = Oem + 2b, age, + cody 
boa, + dob, bocy + did 


and 


ete he aa. + bic. = aybo + by dy 
Lee c1a2 + d\c2 boc, + di dy : 


Thus (A; Az)! = Ay" A). 

The proof of the case when n = 3 follows along the same lines (by brute force), 
but needs summation notation and matrix entries written with their usual double 
subscripts. a 


In order to write out a formula for the inverse of a 3 x 3 matrix (when one exists) 
we first need the definitions of cofactors and the minors of a matrix. 


Definition. Let (i, 7) be a fixed pair of integers and write 


Qi 412 413 
A= | aa, ar 23 
431 432 433 


Then the (7, 7) minor of A, denoted Mj; is the determinant of the matrix formed by 
crossing out the i” row and j“” column of A. The COFACTOR ci; of A is given by 
cy = (-1)'T Mj. 
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Example 123. Let 


23 1 
A=|04 2 
12-1 


Find the cofactors, ¢11, C12, C23. 


Solution. To find the minor M),, we delete the first row and first column of A and 
evaluate the determinant of the resulting matrix: 


23 1 
A=|04 2 Mu =} i|=-#-4=-8 
12-1 
Then the cofactor c;; = (—1)!*!M,, = —8. Similarly, we have: 
ey = (-l)'P Mp = - De 2, 
1-1 
ce 23 
= (-1)°73Mp3 = — =-1, 
e233 = (-1) 23 ; 7‘ 


// 


Exercise 83. Find all cofactors of the matrix A from Example 123. (There are a 
total of 9, can you see why?) 


We now have the background to write out the inverse of a matrix. 
Theorem 24. If A is a2 x 2 or 3 x 3 matrix then 
(i) A is invertible if and only if det A # 0. 
(ii) (2 x 2 case) If A=[%' P'] and det A # 0 then 


az b2 


Am : ie Par (6.1.3) 


~ det A -a. a 


(iii) (3 x 3 cases) If det A ¢ 0 and cjj is the (i, j) cofactor of A, then 


T 
1 C11 C12 C13 
Al =——|oe cy | - (6.1.4) 
det A . 
C31 C32 C33 
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Proof. We first note that if A has an inverse, say A! = X,then AX = I. Thus, 
from Proposition 23 


(det A)(det X) = det/ = 1 


and so det A ¥ 0. To finish Part (i), we must now show, in the case in which n = 2 
or 3, that det A ~ 0 implies that A is invertible. 

In fact, to show that A is invertible we can just write down a formula for AT}, 
Equations (6.1.3) and (6.1.4) give exactly such formulas so we need only show that 
the right hand side of those equations when multiplied by A give J. We will now do 
the case when n = 2. 


A 1 bo —b aa 1 a, by bo —b 
detA}—a. a,;|  detAlarbo||—a a 
1 aybo — byao 0 
det A 0 aybz — bya2 


The same computation in the opposite order gives A~!A = I also, which completes 
the case whenn = 2. 

The case in which n = 3 gets bogged down in notation and we will omit it as it 
is just a straightforward computation (by brute force). a 


Example 124. Find A~! by using Theorem 24 where 


5-41 
A=|-4 62 
1-43 


Solution. Here we will compute the nine elements of the cofactor matrices of A: 


C1, = (—1)'T! det & | = 26, 


—42 
Cp = 1)? | ' i = 14, 
c13 = (-1)'*3 det Cs " = 10, 


—41 
cy = (—1)**! det & = 8, 


51 
Cy = 1)? det |t ; = 14, 
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5-4 
C73 = (—1)7*3 det F | = 16, 


c31 = (-1)°*! aet| s|=-1 


3 = (—1)3*7 det 


ir 


J. 


4 
62 
5 
— 42 |=! 
5 = 
—4 
5-26—4-144+1-10= 84. 


det A = a11¢1) + di2C12 FC = 
Hence, according to Theorem 24, 


26 1410]' 26 8 —14 


A'=—] 8 1410] =—1]1414-14 
a4) iat ia ae 1016 14 


// 


Problem Set 6.1: 


_ 


. Find the sum or product where possible. 


22 


(b) [-3 4] ] 32 1 
5611 
102 
(c) 2[314]+[12-1]| 003 
-104 


2 
31 31 10 
d =5 6 
© [ta] Lad] + loi 
2. If A= [27 _!], find AT and A™! if they exist. 
3. Prove each of the following statements either in the n = 2 case orn = 3 


depending on your instructor’s wish. All matrices are square of size n x n. 
(a) A(B+C)=AB+AC,AER 


(b) (AB)C =A(BC),AER 
(c) A(B+C) = AB+AC 


6.1 


lon 
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. Give an example to show, in the n = 2 case, that for n x n matrices A, B, C, 


and D it is possible that 


(i) AB# BA 
(ii) CD = 0 but C $ Oand D ¥ 0. (Of course, this situation never occurs for 
real numbers.) 


. Find the inverse of each of the following matrices or explain why there is no 


inverse 


62 
(a) a=[74 


(b) B= E ‘ 


21 
631 
(c:) C=] 427 
1058 
101 
(d) D=1}110 
011 
2 30 
(ce) E=| 7 89 
—4-6 1 


. By first computing the minors of the matrix A of Exercise 83, find A7!. 
. (a) If A and B are both not invertible square matrices, show that A + B is not 


invertible or give a counterexample to this statement in the case of n = 2 
or 3. 

(b) If A and B are both invertible square matrices, show that A+ B is invertible 
or give a counterexample to this statement in the case of n = 2 or 3. 

(c) If C is an invertible matrix and A € R is not zero, show that AC is invertible 
or give a counterexample to this statement in the case of n = 2 or 3. 


. Let A and B ben xn matrices. 


(a) Prove or disprove det(A + B) = det A + det B in the case when (i) n = 2 
Gi) n = 3 
(b) Prove or disprove that for A € R, 
det(AA) = A” det A 


in the case where (i) n = 2 (ii) n = 3 
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—-1 0 8 —-1 4 8 
9. LetA = 2 2 0|,B= 1 —4 -—6 
—-1l 0-4 0 3 1 


(a) Determine the determinants of the matrices A and B. 
(b) Determine the determinants of the matrices AB and BA. 
(c) Prove or Disprove: In the case of all 3 x 3 matrices, det(AB) = 
det(A) det(B). 
-1 
10. It is known that ; ; does not exist. What is the value of a? 
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Overview In this section, we will deal first with two linear equations in two 
unknowns describing the various solutions that are possible. Of course, the word 
linear means “line” and that gives us a geometric picture of the algebra that we 
will be doing. Throughout this section, we will interpret our results in terms of the 
geometry of straight lines in the plane. 

Let’s think about the possibilities for two lines in the plane. Geometrically, there 
are only three choices for the relative position of two lines in the plane. They are 
either identical, intersect at exactly one point, or don’t intersect at all. We can also 
see these results from the viewpoint of algebra. If we have two linear equations 
in two unknowns, there are also only three choices: the equations have identical 
solutions (the lines they define are actually the same line), have a unique point 
as a simultaneous solution (the lines have exactly one common point), or have no 
solution at all (the lines do not intersect). (See Table 6.1, page 164, for a summary 
of these interpretations). First, some definitions are needed. 


Definition. A STRAIGHT LINE in the plane is an equation of the form ax + by =c 
where a and b are not both zero. Alternatively, a straight line defined by the real 
numbers a, b, and c is given as a set 


L=({(x,y) | ax + by =c} 


where a and b are not both zero. 


From your school mathematics, it is common to think of a straight line as an 
equation of the form y = mx + k where m is the slope and k is constant (the usual 
letter b is being reserved for another constant). Geometrically, because the line goes 
through (0, k), k is called the y-INTERCEPT, or more precisely, the y-coordinate of 
the y-intercept. Since we will also need to talk about vertical lines, we will define a 
straight line as above. 
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Looking at the definition of a straight line above, the case in which b = 0 but 
a # Ois that of a vertical line. If a = 0 but b ¥ 0 then the line is horizontal. More 
generally, if b 4 O then the line L is not vertical and 


y = mx +k wherem = + is the slope and k = = 
a 


Instead of looking at the situation geometrically, we now view it algebraically 
as a set of equations. It is important to note that it is not the equations themselves 
which matter, but the set of points that satisfy them. 


Definition. The SOLUTION SET for the m equations in 2 variables f\(x, y),..., 


tn(x, y) is 
{(x,y) €R? | AG y) = Ax. y) =-++= fax, y) = 0}. 


The system is LINEAR if each of the functions fi(x, y), fo(x, y),..-, fur(x.y) isa 
straight line. 


Many people make the common error of confusing an equation with its solution 
set. For example, x + y = 0 and 2x + 2y = 0 are not the same equation but they 
have the same solution set {(x, y) | y = —x}, the line through the origin with slope 
—1. We can double the first equation to get the second which changes the first but 
does not change its solution set. The idea of changing equations without changing 
the solution set is very important and we’ll see it algebraically, geometrically, and 
through matrix operations 

The condition that “not both a and b are zero” is said more simply in 
mathematical notation as a* + b? # 0. This will come in handy when we are 
involved in more than one equation. 

We will now work with two linear equations in two unknowns, show the 
geometry of the situation and how trying to find intersection points (a solution set) 
leads naturally to the algebraic concept of the determinant. 


Definition. We will consider the case of the two lines 


Li:aax+bhy=c ai +b; 40 
Ly:a2x + hoy = cy az +b3 40 (6.2.1) 


The system (6.2.1) which is determined by L, and L is called HOMOGENEOUS if 
C) = C2 = O and it is NON-HOMOGENEOUS otherwise. The matrix of coefficients 


of the system (6.2.1) is A = E by | 


az bz |* 
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Since a solution to system (6.2.1) means that a point P = (x, y) is on both lines 
L, and L, P is an intersection point of the lines L; and L>. Looking at things 
geometrically, there are three cases, which contain the definition of a system of 
equations as consistent, inconsistent, dependent and independent. 


Possibility 1. The lines L; and L» intersect. The only alternatives are that L; and 
L» may intersect in exactly one point or their solution set is the same straight 
line so that there are infinitely many points of intersection; that is, L; = Lo. In 
this case, the system (6.2.1) is called CONSISTENT. 

Possibility 2. The lines L; and L» do not intersect. The system of L; and L2 (6.2.1) 
will then have no solution and the system is called INCONSISTENT. 


Consider the possibility where the lines L; and L» intersect. In this case, the 
system (6.2.1) is said to be INDEPENDENT if it has exactly one solution or no 
solutions. If the system is not independent, it is called DEPENDENT. We will see 
that if (6.2.1) is dependent then the first equation becomes a consequence of the 
second equation (or vice versa). Here is a summary of what we will prove in this 
section. 


Theorem 25. Consider the system of two equations in two unknowns given by 
Eq. (6.2.1). The only possibilities for solutions sets are given by Table 6.1. 


Proof. For convenience, we first let A to be the 2 x 2 matrix of coefficients of (6.2.1) 
with d its determinant: 


A= i al det A = ajbo — byan = d.. 
a2 by 


Going back to System (6.2.1), we multiply the first equation of (6.2.1) by b2 and the 
second by b, giving 


ayboyx + bybzy = cybz 
and 


aob\x + byby =). 


Table 6.1 Character of possible solutions to a linear system of two 
equations in two unknowns 


System System 

dependent/ consistent/ 
Case | # of solutions Solution set | independent | inconsitent 
1 None Empty Independent | Inconsistent 
2 1 Point Independent | Consistent 
3 Infinitely many | Line Dependent Consistent 
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Subtracting these two equations and recognizing that 
aib2 — byay = detA =d 
gives, 
dx = cyb2 — dycp. (6.2.2) 


Similarly, multiplying the first equation by ay and the second by a, and subtracting 
gives 


dy = a\C2 — aC} (6.2.3) 


Thus (x, y) is in the solution set of Eq. (6.2.1) exactly when (6.2.2) and (6.2.3) are 
satisfied. We now have three cases. 


Case 1. (d 4 0) By simply dividing (6.2.2) and (6.2.3) by d we obtain 


det cy by a C1 
= a) spe =e ifd =detA=de["] 40. (6.2.4) 


Case 2. If d = 0 and the numerators det . ie | and det] aos |; both equal 0 and 
then the system (6.2.1) has infinitely many solutions. 
Case 3. If d = 0, det] ee | # 0, and det] a fy] # 0, then the system (6.2.1) has 


. co bo 
no solutions. 
Together, these three cases give the Table 6.1. a 


Corollary 9 (Cramer’s Rule). Consider the system of the two equations in two 


ay by 


unknowns (6.2.1) whose matrix of coefficients is A = B on 


| and whose constants 
C1,Cy are on the righthand side of the equations. If d = det A # 0 then there is a 


unique solution to the system and the values for (x, y) are 


det] | det[ 2 2] 
x= —F* and y = aa ae (6.2.5) 


Thus, the condition that det A # 0 means that the solution set of the system (6.2.1) 
consists of a single point whose coordinates are given by (6.2.5). 
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Note that if the homogeneous system 


ayx+by=0 
ax+hy =0 


has non-trivial solutions (that is, solutions other than x = y = 0) then 


d = det i a = 0. 
a2 bo 


Note also that the Case 1, Case 2 and Case 3 can be simplified as follows. Let 


ax+thy=c, 
aX + hy = cp. 
Then 
Case 1. If AG 3 , then it has one solution. 
Case 2. If a a =H = ¢> then it has infinitely many solutions. 
Case 3. If ai = a x a then it has no solution. 


We will use Cramer’s Rule for 1 x n matrices in Sect.6.4 when n/ > 3. 


Example 125. For each of the nonhomogeneous systems (a), (b), (c), what are the 
solutions? 


(a) System (a) is 


2x +5y =2 
x+y =3 
Since ¢ x 2, it has one solution. 
(b) System (b) is 
2x+4y=7 
x+2y =6 


Since 2 = : x Z it has no solution. 


(c) System (c) is 


x—3y=5 
3x —4y = 15 


Since 7 = 3 = % , it has infinitely many solutions. 
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Example 126. Find the solution set for 


3x -—8y =5 

4x+ 9y = 11. 
Solution. The solution set uses Cramer’s Rule. Since det | 3 | = 59, 
det [ 7 =| = 133, and det [ 3 fal = 13, Eq.(6.2.5) shows that x = Be and 
y= zB. The solution set is the one point (S, a). // 


We now give some examples of systems of equations which do not look like they 
are problems about linear systems. It is a transformation which allows us to solve 
the problems using linear systems. 


Example 127. Find all solutions of the system of equations 
3x + y = 6xy and2y —x = Oxy (6.2.6) 


Solution. We are being asked to find the solution set of (6.2.6). Although it doesn’t 
look like a linear system, we can use the principles already established to solve it. 
Firstly, it is obvious that (0, 0) is a solution. Furthermore, if either x = 0 or y = 0, 
then so must be the other one. Thus we are looking for solutions in which neither x 
nor y is zero. We will use a transformation whose variables are now X and Y with 


x #Oandy 40. 
(6.2.7) 
and Eq. (6.2.6) becomes 


X+3Y =6 
2X -Y=9, 


(6.2.8) 


Equation (6.2.4) gives X = 2 and Y = 3 as a solution to (6.2.8). Since X¥ = + 
and Y = oi the solutions set for (6.2.6) is 
{(0,0), (33, 3)}- 
// 


Note that examining the case where x or y is zero allows us to make the 
transformation (or change of variables) given by Eq. (6.2.7). 
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Example 128. Solve the system 


and (6.2.9) 


4 3 
x-3 2y-1 


=1. 


Solution. Since x #4 3 and y # 5 (because Eq. (6.2.9) will make no sense), we 


may make the transformations X = + and Y = sy: Then we have a system of 


equations in our new variables X and Y which is 


3X +4Y =7 
4X —3Y =1. 


(6.2.10) 


Now, d, which is the determinant of the coefficient matrix is given by 


3 4 
det = —25. 


So Corollary 9 shows that 


det ie 
1-3 —21-4 
X= = =] 
—25 —25 
and 
det a7 
41 —25 
Y SSS — S| 1 
—25 —25 
We have now solved the system (6.2.10) and found X and Y. However we were 
asked to solve system (6.2.9) for x and y. Since 1 = X¥ = — we see xX = 
4. Similarly, 1 = Y = Zot so that y = 1 and (4, 1) is the unique solution to 
Example 128. // 


Example 129. Let a € R. Discuss the characteristics of the solution set in terms of 
a of the system 


ax —6y = 5a—3 
2x + (a—7T)y = 29—-Ta. 
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Solution. Here, we are asked how the solution set varies as a does. To use 
Theorem 24 (and Corollary 9), we need to compute three determinants. The 
coefficient matrix of the system is 


Thus 
a —6 
d = det =a’—Ta+12 
eto anal ’ a? 
5Sa-—3 —-6 a 
= _ 19 
tet twa t| 5a~ — 80a + 195 
and 


a 5a—3 2 
det =— ; 
e | Ta’ + 19a+6 


Putting these last three equations into (6.2.2) and (6.2.3) shows that, for a solution 
(x, y) to exist, 


(a? —7a + 12)x = 5a? — 80a + 195 
and 

(a* —7a + 12)y = —7a”7 + 194+ 6. 
Factoring each side of the last two equations shows that x must satisfy 

(a — 3)(a — 4)x = 5(a — 3)(a — 13) (6.2.11) 
and y must satisfy 

(a —3)(a -—4)y = —(Ja + 2)(a — 3). (6.2.12) 

If a = 3 then (6.2.11) and (6.2.12) both reduce to 0 = 0 and there is an infinite 

solution set (given by either 3x — 6y = 12 or, what is the same 2x — 4y = 8). 


If a ¥ 3 then (6.2.11) and (6.2.12) become 


(a — 4)x = 5(a — 13) and (a —4)y = —(Ja + 2). (6.2.13) 
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Table 6.2 Solutions to 


Powanle tee alist _|-peluonnet — 
nae a=3 {(x, y) | 2x —4y = 8} 
a=4 Empty 


5@—13) _ Ta2 
a#3,a#4 | (=>, -ee)h 


If a = 4 then (6.2.13) shows that there is no solution. If a 4 4, then we can solve 
(6.2.13) directly and come up with 


= Ta+2 
x = ——— andy =—- 
—4 a—4 


as the unique solution. We collect these solutions in the table above (Table 6.2). 
// 


Problem Set 6.2: 


For each of the Problems | through 4, give all solutions or state that there are 
no solutions. State whether the system is dependant, independent, consistent or 
inconsistent. Justify your answers. 


1. 
6x -—4y =4 
—3x+2y=a7 
2 
10x —6y = 8 
—35x — 15y = 20 
3. 
4y+z=6 
—yt2z=9 
4. 
x+3z=8 


4x-z=9 


6.2 Systems of Linear Equations in the Plane 
5. Let x and z be real numbers. Find all solutions to the equations 
4x —z=2xz 
and 
2z—2x = 7xz. 
6. Are there real numbers w and f with 
3t + 7w = 2wt and 2t + 4w = 7wt? 
7. Solve if possible for w and f. 
(Qw—-1)+t=2t(w-1) 
and 
3w + 3t = 4t(w— 1). 
8. Solve for x and y, if possible. 


2 3 


Sy a ae 

x+5 y-7 

and 

3 2 
aoe 2x6 

x+5 y-7 

9. Solve for x and y if possible. 

2% =5-274+7 


3-2%*+4-2% =2 
10. Solve for x and y, if possible. 


e~+2e7 =4 
—3e* + de? =5 
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11. Solve for x and y, if possible. 


ert? —e’ =3 


3e* + 4e7 = 4 

12. Solve for x and y if possible. 
ex 4 2rtla7 
e*—2 =5 


13. Leta € R. Discuss the characteristic of the solution set of the system. 


(a —3)x —2y =2a 
3x + (2a+ 1)y =-a-2 


14. Solve for x and y, given real numbers a and b. 


x+ay+a’?=0 
x+by+b?=0 


6.3 Systems of Linear Equations in Euclidean n-Space 


Overview The many similar computations and row operations in the first two 
sections of Chap. 6 lead us to consider if there is a natural connection between the 
material. In this section, we show how operations on matrices can be interpreted 
in terms of solutions sets to a system of linear equations. Cramer’s Rule (Sect. 6.4) 
follows easily from the use of matrices. We will go on to use the ideas of Sect. 6.5 
to simplify computations such as finding the GCD of two integers. 

We will continue to use the notation of Eq. (6.2.1) by starting with two lines L; 
and L2, which are each given by two equations 


Ly:aqyx+byy =c, a? +b? 40 


Lo: ayx + boy = cy a, +b; £0 (6.3.1) 


and ask for the solutions set to these equations but this time matrices are used. Once 
again, we write 


A= EB ‘| andd = det A 
a2 02 
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for the matrix of coefficients of (6.3.1). Let C = | Then the system (6.3.1) 
c2 


becomes 
A | =f. (6.3.2) 


What is the inverse of A? In order for that to happen d = det A ¥ 0. In that case, 
Eq. (6.1.4) shows that 


1) _ 1[ by -b 
A =3| |: (6.3.3) 


—a2 ay 


Proposition 25. [f the matrix of coefficients of the system (6.3.1) has a non-zero 


. : P : x 
determinant then there is a unique solution to (6.3.1) such that 
y 


Jere 


det{ © 


c2 bo 


a\ 4a) 41 bd 
det] iis Bo 


and 


x= (6.3.4) 


det[ 3] es 
[ 


Sie (6.3.5) 


a 4. a1 by) 
det mn 


| 
| 
| 
| 


Proof. Let X = H be the 2 x | matrix of unknowns. Since AX = C and A has 
y 


an inverse, A~'AX = A7!C so that H = A7'C. Now we will actually compute 
y 
x and y using (6.3.3). 


_ 1 cib2 — bi co 
~ d a,C2 — A2C, ; 
The entry cjb2 — bc is 


det[ 1] 


c2 by 
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whereas the entry c,b2 — b,c is 


det[ # *], 


az by 


which results in (6.3.4) and (6.3.5). a 


Notice once again, when the appropriate machinery is derived, how easy some 
results appear. The solution (when the matrix of coefficients does not have zero 
determinant) to the system (6.2.1) is given by Eqs. (6.3.4) and (6.3.5). 

Is there an easy way to remember the solution of the linear system AX = C if 
d = det(A) # 0? Yes, there is! Examining the conclusion of Proposition 25 shows 
that 


= boc — bycr 
ad 
cy by 
(69) by 
of A (the “x column” if you like) and replacing it with the coefficient column C = 
| . The solution for y is the det EB a 
C2 a2 C2 
found by replacing the “y column” by the coefficient matrix C. Thus, the way to 
remember Eq. (6.3.4) is to replace the x-row in A with the constants of the original 
equation, compute the determinant and divide by d = det(A). A similar statement 
holds for Eq. (6.3.5). 

The use of matrices also allows us to handle systems of m equations in n 
unknowns. This is really the subject of a semester course in linear algebra and will 
not be treated in full generality here. 

However, we will now look at the situation in which the number of equations is 
the same as the number of variables, ultimately specializing to the case of n = 3 
dimensions (or a system of 3 linear equations in 3 variables). 


is the same as the det I The matrix is obtained by taking out the first column 


/d which is the determinant of the matrix 


Definition. A LINEAR EQUATION inn variables x,,...,X, is an equation of the 
form 


AX, + A2X2 Het FAnXn = C 


where d1,...,4, are real numbers, not all zero and c € R. 
A LINEAR SYSTEM of n equations in n unknowns is a collection of 7 linear 
equations in n variables. 


There are three points to note. The first is that if a linear system of 1 equations in 
n unknowns is given then there are n* real numbers a; ; (called the COEFFICIENTS 
OF THE SYSTEM) and n real numbers c,...,¢,, such that 
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Ay X1 + j2X2 +++ + ainXy = Ch 


A21X1 + A22X2 + +++ + A2nXn = C2 
(6.3.6) 


Ani X1 + An2X2 +++ + AnnXn = Cn 


and for each i, Yi =0 aj, # 0 (this is the statement of the technical point that, for a 
fixed 7, not all a;; are zero). 
The second is the 3 x 3 case. When there are 3 equations in 3 unknowns, (6.3.6) 
is written (usually with x; = x, x. = y, and x3 = z) as 
aX + a2y + 4132 = C1 
a2X + avy + a23Z = C2 (6.3.7) 
a31X + 432V + 433Z = C3. 
Equation (6.3.7) should be familiar to you from Algebra II when you were solving 
3 linear equations in 3 unknowns. 
The third point is that the number of equations and the number of unknowns is 


the same, n. Solving for 2 equations in 3 unknowns (or 3 equations is 2 unknowns) 
gives quite different alternatives than 3 equations in 3 unknowns. 


Proposition 26. The system of linear equations (6.3.6) has a unique solution if the 
determinant of the matrix of coefficients of the system is not zero. 


Proof. Let A be the matrix of coefficients of the system, so that 
1 412 *** Ain 
421 422 *** Gn 
i 
Qn1 Gn2 *** Ann 


and let C be the n x 1 matrix given by the right hand side of the system (6.3.6) 


If we write the unknowns as ann x | matrix, X, where 


x} 
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then (6.3.6) becomes 
AX =C. (6.3.8) 


Since det A # 0, there is an inverse to A and multiplying the last equation by A7! 
yields A~'AX = A7!C or 


X=A'C, 


Of course, we could substitute X = A~'C into Eq. (6.3.8) and see that X satisfies 
that equation. 

We have, from Eq. (6.1.4) a precise form for the inverse of a matrix, so that, if 
cj,; is the (7, 7) cofactor of A, then 


T 
Ci C12 +++ C13 
1 C21 C22 +++ €23 C 
det A , 
C31 C32 ++ C33 
In the 3 x 3 case 
T 
x 1 C11 C12 C13 Cl 
X= Vi =>. 1 | ©6201 ©o2 G3 c2|- (6.3.9) 
det A 
4 C31 €32 C33 C3 


We now consider ELEMENTARY ROW OPERATIONS on matrices which will be 
very useful in the next section. The key point is that performing an elementary row 
operation on a matrix does not change the solution set to the corresponding linear 
system (Theorem 26). 


Definition. Let B be an m x n matrix. An ELEMENTARY ROW OPERATION on B 
is one of 


(i) Two rows of B are interchanged 
(ii) Multiply the ith row by a real number c ¥ 0. 
(iii) To change row i, row number 7 is multiplied by a real number, A, then added 
to row i and placed in row i. This means that R; + AR; — R;. The other rows 
of the original matrix B stay the same. 


Example 130. 


(a) If B = : : ‘| then what is the result of multiplying the first row of B by 


(—2) and adding it to row 2? 
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(b) If the elementary row operation, interchange the second and third rows of 


23 54 
D= 31 72], 
—33-47 


what is the resulting matrix? 


Solution. The elementary row operation adds (—4 — 6 — 2) to the second row but 
leaves the first row unchanged. Thus 


231 
B— fe 0 4 : (6.3.10) 


For the second part, interchanging the second and third rows gives 


23 54 
P| H99247). (6.3.11) 
ie oo 


// 


For motivation as to why we use elementary row operations, let’s look at the 
linear systems that B or D represent and see what happens to the solution set under 
an elementary row operation. 

Consider the linear system 


2x+3y=1 
(6.3.12) 
4x + 6y =3 


The coefficient matrix of the system is li ‘ and the two constants ; | give the 


matrix B of Example 130(a): 


Multiplying the first equation in the system (6.3.12) by (—2) and adding it to the 
second equation of (6.3.12) gives (repeating the first equation) 


2x+3y=1 
(6.3.13) 
Ox + Oy = 1. 
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This last system is represented by 


231 
ie | (6.3.14) 


but we’ve not changed the solution of (6.3.12) because we have performed simple 
algebra (multiplying one equation by (—2) and adding it to the other). 


Exercise 84. What system does the matrix D of Example 130 represent? What 
system does the matrix D after the elementary row operations described in the 
example represent? Do the two systems have the same solutions set? What does 
the second row mean in Eq. (6.3.14)? 


Remark. We have just observed that elementary row operations allow us to manip- 
ulate a system of represents them without changing the solution set of the original 
equation. 


Thus we have the following theorem. 


Theorem 26. /f S is a linear system represented by the matrix B and e(B) is the 
matrix which comes from applying an elementary row operation to B, then B and 
e(B) are matrices whose systems have the same solution set. 


Example 131. Find the solutions of the system of Eq.(6.3.12) [see Exam- 
ple 130(a)]. 


23\1 


Solution. Equation (6.3.12) is represented by B = E 613 


23\1 
00'1]- 


But e(B) represents the solutions to the equation 


and it has the same 


solution set as in (6.3.14) 


2x +3y = 1 
Ox +0y=1 or O= 1. 
Thus, there are no solutions the system of (6.3.12). // 
Example 132. Solve the linear system 
y—3z=4 
2x + 3y—-z=6 
4x + 5y + 3z=8 


by using elementary row operations. 
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Solution. 


01-3,4 
23-16|— 
45 38 


23-16 
01-314 interchange row | and row 2 
45 38 


2 3-1, 6 
0 1-3: 4 Add —2 times row 1 to row 3 
0-1 5'-4 


Add row 2 to row 3 


Add row 3 to row | 
00 10 


230,6 
01014 Add 3 times row 3 to row 2, this shows y = 4 
001°0 


200-6 
010: 4 Add —3 times row 2 to row 1 
001 0 


23-16 1 
> |01-314 Multiply row 3 by 5 This shows z = 0 


100,-3 1 
>|]010: 4 Multiply row | by a This shows x = —3 
001 0 


Hence we have found the solutions x = —3, y = 4,z = 0. Note the above process 
is called Gaussian elimination, after the German mathematician C. F. Gauss (1777-— 


1855). 
Problem Set 6.3: 


1 4 3 
1. LetA = 2=3 3). 
—2 3 1 


(a) What is the resulting matrix if you multiply the first row of A by 3 and then 


add it to row 2? 
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(b) What is the resulting matrix if you interchange the second and third rows 
of A? 

(c) What is the resulting matrix if you perform both operations from Parts (a) 
and (b)? Does the final matrix depend on which order you perform the 
operations? 


. Find solutions to the system of equations below using determinants: 


2x +4y = 12 
(6.3.15) 
—-x+5y =9 
. Find solutions to the system of equations below using determinants: 
x+y+z=3 
2x + 3y-z=-4 (6.3.16) 


—4x —2y + 3z=6 


. Can you perform elementary row operations on the matrices below and end 


up with the matrix J. If so, list the necessary row operations and show each 
resulting matrix after performing the operation. If it is impossible, explain why. 


1 00 
(a) |4 2 Of. 
0-3 1 
1—2 0 
(b) | 0-2 O}. 
0-2 1 
110 
(c) }003 
001 
ae ee ~2/7 —5/14 11/7 
(d) |0—2 4]. | Answer: | 2/7 5/14 —4/7 
1-1 4 -1/7 3/7 -2/7 


. Determine if the below augmented matrices represent systems with one, 


infinitely many, or no solutions first by solving for the determinant and then 
again using Gaussian elimination but do not actually solve the system. In each 
case, determine which method was simpler and why. 


A 3 3)2 
“1411/3 


| 216 
ll. 
& an 
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vex of =9> 3] 1 
iii. 
10 —6|—2 


24 12) 18 
iv. | 06-9) 18 
00 17\204 


6. Solve the following systems of equations by Gaussian elimination. 
i. 


x+3y+z=4 
2x + 8y —2z=2 
3x + 9y —9z = 16 


2x + 4y —7z=6 
x —3y—-—2z7=2 
6x + 12y —21z = 24 


ili. 
x+3z=6 


6y —2z = 2 
x—12y+7z= 16 


6.4 System of Linear Equations: Cramer’s Rule 


Overview The Cramer’s Rule in Corollary 9 can be generalized easily to a system 
of n linear equations in 7 variables. In Sect. 6.4, we will use n = 3 throughout and 
follow the use of matrices. 


Theorem 27. Consider system of 3 linear equations in 3 unknowns 
aX + ayy + 4432 = by, (6.4.1) 
aX + a22y + a3Z = bo, (6.4.2) 


a3)X + a32y + a33z = D3. (6.4.3) 
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We write 
Q\1 G12 413 
A => a21 A22 A23 and A => det A => |A| 
431 432 433 
by ay 443 ay by ay3 a1 412 by 
Ay = |bo a2 473), Ay = Jao, by ay3), Az = Jaz a2 bo). 
b3 32 433 431 b3 a33 431 432 b3 
If A # 0, then 
A, Ay A, 
x= y= i 3 
A A? A 


Proof. We know that Eqs. (6.4.1), (6.4.2), and (6.4.3) can be written as 


ay G12 A143 x by 
a21 422 423} =| VJ] = by 
31 432 433 z b3 


Theorem 22(iii) on page 159 uses the (i, 7) cofactors of A whose notation is c;; for 
1 <i <3and1 <j <3. Thus we have 


by XxX 

AX = | bo where X = | y 

b3 v4 

The Eq. (6.1.4) shows that 
T 

x [cue es by y | cu C2 31 by 
X= Yi) = WI C21 €22 €23 bo | = — | €12 €22 €32 by 
[A| | i 
z C31 €32 C33 3 C13 C23: C33 3 


Therefore, with A and | A| being the same notation for det A, 


1 1 
x = —(bi ey + bye + b30¢31) = —Ay, 


| A| |A| 
y= ais + bye + 63032) = i, 
| A| |A|~ 
L= ee + bye23 + b3¢33) = Ae, 
| A| | A| 
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Example 133. Use Cramer’s Rule to solve 
71x -y—-z=0, 
10x -—2y+z= 8, 
6x + 3y —2z=7. 
Solution. The determinants are 
7-1-1 0-1-1 
A= ]10-2 1)=-61, A,=j/8-2 1/=-6l, 
6 3-2 7 3-2 
70-1 7-10 
Ay =]108 1) =—183, A, = |10 —2 8} = —244. 
67-2 6 37 
seca fe Nn eas A, _ = 
Thusx = 48 = == Ly= f= 2 a3,2= $= Ha, // 
Exercise 85. Use Cramer’s Rule to solve 
4x + 3y + 3z = 3, 
2x —4y +4z=7, 
3x + 2y + 6z = 3. 
(Answer: x = 31 y= —3,z = a 
Example 134. Use Cramer’s Rule to solve the system of equations in x, y and z. 
3 6 1 
+ + =1, 
ytz z+x x+y 
6 4 1 
+ a0 = 5 
yz zZt+x x+y 
15 2 3 
+ aD a 
yz zZt+x x+y 
Solution. Let X = Me ot Z= . We now have three linear equations 


in three unknowns oa Y Aad Zi. 


3X +6Y+Z=1, 
6X + 4Y —-Z =3, 
15X + 2Y —3Z =6. 


198 6 Matrices and Systems of Linear Equations 


By using Cramer’s Rule, we obtain 


1 1 
X=-, Y=-, Z=-1. 
6 4 
Thus ss — i, - - — i +“ = —1. Therefore, we have three linear equations in 
x,y and z. ; 
y+z=6, 
z+x=4, 
x+y=-l. 
Solving this system, we have x = —3, y= s, Z= ql // 


Exercise 86. By using Cramer’s Rule, solve 


1 1 1 
—~+-—+-=0, 
HE Wo OZ 


Sas ee 
—~4+—=4+==5, 
y 


cad 
N 


(Answer: x = 1/2, y = 1,z = —-1/3.) 
Example 135. Yt is known that the following system has a unique solution, find the 
value of a. 


ax+y+z=1, 
x+tay+z=1, 
x+tytaz=l. 
Solution. Since 
all 
A=|lalj/=a4+1+1-a-a-a=a -3a+2=(a-1° (+2), 
lla 


we must have a # 1, a # —2. Thus, all values of a except a = | anda = 2 havea 


unique solution. // 
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Exercise 87. It is known that the following system has solution x = 0, y = 0, 
z = 0 for any value ofa € R. 
—x+y+z=ax, 
xX-yt+z= ay, 
X+y-Z= az. 
Find the values of a so that the system has no other solution. (Answer: a # 1 or 
a#2.) 


Example 136. Let 3 positive integers be given whose sum is 51. If the first positive 
integer is divided by the second one, then the quotient is 2 and the remainder is 5. If 
the second positive integer is divided by the third one, then the quotient is 3 and the 
remainder is 2. Find the value of these three integers. 


Solution. Let x, y,z be the given positive integers. Then 


x+y+z=5l, 
x=2y+5, 
y =3z2+2. 
This system is equivalent to 
x+y+z=5l, 
x—2y =5, 
y—3z=2. 


Using the same notation as Theorem 27 


id 4 
A=|1-2 0|/=6+1+3=10, 
Od = 
511 1 
A, =|5 -2 0| =51()+54+4+4 15 = 330, 
S 1-23 
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151 1 
Ay=|1 5 0|=-15+4+2+ 153 = 140, 
0 2 -3 
1 1 51 
A, = |1-2 5| = 40, 
01 2 
We havex = 42 =33,y= = 14,2= & = 4, i 


Exercise 88. If person A, B, C work together for one job, they needs 10 days to 
finish it. If B and C work together, then they can finish it in 15 days. If A works for 
15 days, then C needs 30 days to finish it (B is not working). How many days will 
be needed if A, B, C works alone? (Answer: x = 30, y = 20, z = 60.) 


Example 137. Leta, b,c be the lengths of a triangle, and let 
x+y+z=0, 
ax + by +cz=0, 
axt+b’y+c?z=0 
have only one solution (0,0,0). Then what kind of equations can this triangle be 
described by (if at all?) 


Solution. Since 


1 
c| = (a—b)(b—c)(c—a) £0, 


2 2 


[oe 


1 
A=l|a 


a 


> 


we havea # b F c. Hence the triangle is not equilateral triangle nor isosceles 
triangle. Note: the matrix 


a c 
a? b? c? 
is called a Vandermonde matrix, named after A. T. Vandermonde (1735-1796). // 
Exercise 89. Compute the determinants of these Vandermonde matrices: 
1 1 1 15 25 


10 11 12] and |1 9 81 
100 121 144 1 12 144 
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Example 138 (Fitting for Quadratic Polynomial Curves). Let A = (—1,2), B = 
(1,5), and C = (2,3). Show that {A, B, C} forms a quadratic polynomial curve. 


Solution. Let p(x) = a + bx + cx? and hence, 
2= p(-1)=a-—b+c _ ford 
5=p(l)=a+b-+4+c_ forB 
3 = p(2)=a+2b+4c_ forC. 


The solution of this system of linear equations in the variables a, b, and c is 


14 3 
a= DS 5 ande = —<. 
Thus, the quadratic polynomial is p(x) = a + $x = ex". // 


Exercise 90. Find a quadratic polynomial curve that fits the points D = (—2,4), 
E = (1,5), and F = (3,7). 


Problem Set 6.4: 


For Problems | through 6, solve the given system of linear equations by Cramer’s 
rule. 


1. 
x+2y+z=6 
2x—4y+z=7 
3x + 3y + 5z = 10 
2. 
3x + 4y + 3z=0 
x+2y—-z=-l 
x+y+2z=1 
3; 
x+y—-z=0 


x-2y+z=-3 
4x + y—2z=-3 
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4. 
1 1 1 
—+-+4+-=0 
xX yp 2 
ae re 
—--+-=6 
x yp 2 
2 3 
—-+—-+-=-5 
y 2 
5. 
3 6 1 
+ ao =1 
yz zZt+x x+y 
6 4 1 
+ — = 
yz zZ+x x+y 
15 2 3 
+ _ —| 
yz zt+x x+y 
6. 


x+ty+z=1 
ax+by+cez=d 


a’x + b?y + c?z = a? 
7. Assume that 
x+3y+2z=—-3 
2x+y+3z=1 
x+tay+z=6 


has infinitely many solutions. Then what is the value of a. 
8. Assume that 


ax—y+2z=0 
x+y-z=0 
3x+y=0 


has infinitely many solutions. Then what is the value of a. 
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9. Assume that the following systems of linear equations have the same solution. 
What is the value of a, b,c? 


xty+z=6 ax —by +cz= 12 
3x-—y+2z=9 and ax + by +cz = 32 
3x —2y—5z=0 cx —by—az=0 


6.5 Applications of Matrix Operations to the GCD 


Overview For readers who have studied matrix algebra, there is a more procedural 
way to find the GCD(a, b) and a particular solution of the integral linear combina- 
tion. We will describe this matrix process through Example 139 on the next page 
which readdresses the case of Example 43 (page 54) through the use of matrices. 

To accomplish the matrix method for GCDs, we first write the two integers a,b 
as a column [4%] and then append the identity matrix. We write N = [/] and J for 
the identity which appears as 


A=[N I], 


a2 x 3 matrix. We can now use elementary row operations on A, where A has only 
integers as elements. These row operations are called INTEGRAL ELEMENTARY 
ROW OPERATIONS. We then add or subtract an integral multiple of one row from 
another until the (2,1) or (1, 1) entry is zero. The top row of the reduced matrix 
gives the results (or the bottom if the (1, 1) entry is zero). 

Here is a justification of this matrix approach. Note that the above elementary 
integral row operation on A is equivalent to multiplying on the left of A by a matrix 


F 4 or i "| for some integer k. So the sequence of this operation multiplies A 


on the left by a matrix B = ? with det B = sv — ut = 1, where s,t,u,v are 
uv 
integers. Therefore 
paa|>* alo] _ Ssat+tb st 
uv||b01 ua+vbuv 

Now if ua + vb = 0, then (ua’ + vb')g = 0 where g = GCD(a,b) and 
ua’ + vb’ = 0. Since GCD(a’, b’) = 1, we obtain u = db’, v = —da’, for some 
integer d. Now sv — ut = 1 = > s(—da’) —db't = 1 = > d(sa’ + tb’) = 
-1=$d=dHl,sa'+tbh' = Fl = sat+tb = g(sa' 4+ tb’) = +g. Similarly 


if sa + tb = 0, then ua + vb = +g. These matrices show that the GCD approach 
to matrices is a proof. (See also Silvester [11]). 
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Example 139. Find the GCD(611, 235) and a particular solution of the integral 
linear combination by matrix method. Compare the matrix method with the 
procedure in Example 43. 


Solution. As described in this example, we first write a 2 x 3 matrix augmented 
with the identity in the last two columns. 


= 611,10 
235'0 1 
Using (—2) times Row 2 and adding it to Row | we have 


B= 141,1 —2 
235'0 1 
Subtracting Row | of B from Row 2 of B gives 
141, 1-2 
ile 94 '-1 ‘| 
Subtracting Row 2 from Row | of C yields 


D= 47, 2-5 
94'-1 3 


From which we obtain 


E= 47, 2-5 
0'—5 13 
The GCD(611, 235) is the (1,1) entry of E and the coefficients of the integral 


combination of 611 and 235 are the entries in the (1,2) and (1,3) spots. More 
specifically: 


GCD(611, 235) = 47 and 47 = 2(611) — 5(235) 


as before. // 


Remark. The matrix method is quite useful because it both finds the GCD and 
simultaneously expresses the GCD as an integral linear combination. 


Exercise 91. Use the matrix method to find the GCD(24,116) and write 
GCD(24, 116) as an integer linear combination of 24 and 116. 


This matrix method can also be applied to find the particular solution of linear 
Diophantine equation of two variables. 
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Example 140. Find the particular solution of the integer solution of 16x+39y=10. 


Solution. First we set up the matrix as in Example 139 


16,1 0 
39'0 1] 


Add —2 times row 1 to row 2, we have 


16, 1 0 
7'-21]° 


Add —2 times row 2 to row 1, we have 


2, 5 =2 
7-2 1 4] 
Add —3 times row | to row 2, we have 
2, 5 -—2 
1-17 7 | 
Add —2 times row 2 to row 1, we have 
0, 39 —16 
1-17 7 | 
Multiply row 2 by 10, we obtain 
0, 39 —16 
10'-170 70 |” 
Thus the particular solution is x = —170 and y = 70. // 


The next step is to apply the matrix method to find the greatest common divisor 
of polynomials, d(x) = GCD(a(x), b(x)). It also is important to write d(x) as 
the linear combination of a(x) and b(x) using polynomials. Here are a number of 
examples. 


Example 141. Use the matrix method to find g(x) = GCD(x> + 6x? — 49x + 
42, x4 + x3 — 9x? — 3x + 18). 
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Solution. 


x? + 6x? — 49x + 42 10] Risn—x 
x4 + x3 — 9x? —3x + 18'01 


—x4 + 9x3 + 9x? — 67x +. 42)1 —x ] Ri=ntn 
4 Caney Por cae ars 
x" +x 9x 3x+18 '0 1 


10x? —70x +60 j1l—x +1] R=10r—-xn 
—— 
x4 + x3 —9x?-3x+18'0 1 


10x? — 70x + 60 1 —x+1 R2=r2—-1| 
10x? — 20x? — 90x + 180'—x x7 -—x+ 10 


10x? — 70x + 60 1 —x +1] R=2n+xr 
————E 
—20x? — 20x + 120'-x —1 x2 +9 


be — 20x + 120,—x? —x +2x7+7x + *] Ri=r—-1 


—20x? — 20x +120! -—x-1 x7 49 Ro=—r 
0 —x? +3 x3 —x2+7x —7 
20x? + 20x — 120! x +1 —x?—9 


So g(x) = 20x? + 20x — 120 and (x + 1)(x> + 6x? — 49x + 42) + (—x? — 9) (x4 + 
x? — 9x? — 3x + 18) = 20x? + 20x — 120. // 


Example 142. Use the matrix method to find g(x) = GCD(x* + 3x — 2,2x? + 
x? +x +6). 


Solution. 


x+43x—-—2 (10] Ri=2r-xr 
3 2 ars 
2x? +x°+x4+6'01 


—x3—x2-—4 2-—x] R=2rn+n 
2 ho ee 6108 I 


—x3—x?-—4,2 —Xx Ri=r1—xr2 
F ; > 
—x°+x—-—2'!4—-2x+1 


—2x? + 2x —4,-4x +2 2x? —2x ] Rian 
—x?+x-2 4 —2x+1 


Ro=-r2 
x? —x 4+2)2x—1—x? +x] R=n-n 

2 Z 

x —-x+2' —4 2x-1 


0 2x+3-—x2-x-1 
x?>—x+2! -4 2x —1 


6.5 Applications of Matrix Operations to the GCD 207 
Then g(x) = x* —x + 2 and (—4)(x* + 3x —2) + (2x —1)(2x3 +.x7+x+4+6) = 
x7 —x 42. // 


Example 143. Use the matrix method to find g(x) = GCD(x* + x? + 2x7 +x 4+ 
1,x?-—1). 


Solution. 


xt tx + 2x2? +4 41,10] Ri=n—xr 
x] 01 


x3 + 2x? +2x+1)1—x] Ri=n-r 
rs 
xe —1 ‘0 1 


2x? + 2x +2)1 —x—1] R=2r-xr} 
xe] 0 1 


2x7 +2x+2,1 -x-1 R=ntn 
as 
—2x? —2x —2!—x x? +x42 
2x7 +2x+2, 1 -x-1 
0 —x+1x7+4+1 
So g(x) = 2x? + 2x +2 and (1)(x* + x3 + 2x7 +x%4+1)4+ (—x-DO3-1D= 
2x? +2x +2. // 


Exercise 92. Write d(x) = GCD(a(x),b(x)) = p(x)a(x) + q(x)b(x) where 
a(x) = 7x3 + 6x? — 8x + 4 and b(x) = x3 + x —2. 


The matrix method can also be used for solving ax + by = c. Theorem 9 
(page 64) gives a necessary and sufficient condition for a solution to exist. 


Example 144. Solve 172x + 20y = 1000. 
Solution. Since GCD(172, 20) = 4| 1000, it has solutions. 


172,10] R=n-8r [12,1 -1] R=n—n 
ca a 
Ee a ae 1 
12, 1 —8 | Ri=n-r2 4,2 -17 | R2=r2—-2r 4,2 -17 
—— ————— 
E '_] 9 ie 9 bee 43 
So 4 = (2)(172) + (-17)(20) > 1000 = 250(4) = 250((2)(172) + (—17)(20)) = 


500(172) — 4250(20). So one solution is x) = 500 and yy) = —4250. Therefore, by 
Theorem 26, all solutions are of the form 


20 172 
x = 500+ (7) t=500+5¢, y= —4250-— (Fs = —4250 — 43. 


// 
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Exercise 93. Determine all integer solutions to the following Diophantine equa- 
tions by matrix methods. 


(a) 56x + 72y = 40 
(b) 24x + 138y = 18 
(c) 221x —354y = 11 


Problem Set 6.5: 


1. Using the Matrix Method of Example 139, find the greatest common divisor of 
the two given integers. Then write the gcd as a linear combination of the two 
integers. 


(a) 260 and 90 
(b) 1785 and 340 
(c) —616 and 286 


2. Determine the gcd for each pair of numbers from Problem | using the Euclidean 
Algorithm from Chap. 2. 

3. Using your work from Problem 2, determine a solution to the integral linear 
combination problem for each pair of numbers from Problem | using Tableau 
method. 

4. Given integers 297 and 693, 


(a) Determine GCD(297, 693) using the Euclidean Algorithm and then solve 
the integral linear combination problem using back substitution. 

(b) Determine GCD(297, 693) and solve the integral linear combination prob- 
lem using the Matrix Method. 

(c) Write a paragraph describing how these procedures work and the corre- 
spondence between the repeated division in the Euclidean Algorithm and 
the row operations in the Matrix Method. 


5. Find d = GCD(a,b) and integer x and y such that d = ax + by by using 
matrix method 


(a) a=52,b= 96, 
(b) a= 15,b = 50, 
(c) a = 5312,b = 2046. 
6. Find the particular solution of 
(a) 7x + 15y = 51, 
(b) 16x + 33y = 100, 
(c) 101x + 99y = 437. 
7. Write d(x) = GCD(a(x), b(x)) = p(x)a(x) + g(x)b(x). 
a(x) = x* + 3x —2 and b(x) = 2x7 4+ x7 +x4+6. 
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6.6 Evaluations of Determinants of 3 x 3 Matrices 


Overview Section 6.6 is a review for some routine results (such as Theo- 
rems 27, 28, and 29) which you’ve seen already. We provide an effective way 
to calculate the 3 by 3 determinant. 


Theorem 28. Let 
ay by cy 
A= a2 by C2 
a3 b3 C3 
If B is obtained by multiplying a row by a constant k, then det B = k det A. 
Proof. Let 
a by 


B= kar kbo Keo ; 
a3 b3 C3 


then 


aj by Cl 
det B = |kay kby kcen| = —kay 
a3 b; C3 


a, by cy 
=k a2 by C2|- 
a3 b3 C3 


Similar calculation demonstrate the result when a multiplication is applied to 
another row. a 
Example 145. Note that 

V2 V2 V2 111 


2 1 5 |=v6l21 5 
V3 V3 —-V3 eee 
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Example 146. Note that 


28 16 14 8 
3 24 30) = 30]1 8 10 
5 15 —5 13-1 
Theorem 29. Let 
a\ by Cl 
A= a2 by C2 
a3 b3 C3 


If B is obtained from A by adding a multiplication of a row of A to another row of 
A, then det B = det A. The same is true when interchanging columns. 


Proof. Let 
at by Cl 
B= )|a.+kaz3b.+kb3 o.+ke3 |, 
a3 b3 C3 
then 
aq by cl 
det B = jay + ka3 by +kb3 co + ke 
a3 b3 C3 
= by ¢1 ai cy 1 by 
= —-(a. + ka3) + (b2 + kb3) — (cz + ke3) 
b3 c3 a3 ¢ 3 b3 
by c ay C1 a, by by c aq Cy a, by 
=|[_- oa ee 
Gane a3 ¢3 afl) 4 ( Ibs a3 3 eal 
a, by cy 


az by c2| + k(—a3(b1¢3 — b3c1) + b3(aic3 — 431) — €3(a1b3 — a3b1)) 
a3 b3 63 


= det A 


Example 147. 
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Solution. Adding —1 times the first row to the third row // 


Remark. The above theorem holds for determinants of all m x n matrices. Indeed, 
corresponding results also hold for columns and elementary column operations in 
all determinants because of Theorem 29. Note that elementary column operations 
are exactly analogous to elementary row operations. 


The following three theorems are concerned with the elementary row operations 
and determinants of 3 x 3 matrices. 


Theorem 30. Let A and B be 3 x 3 matrices. If B is obtained from A by 


interchanging two rows, then det B = — det A. 
Proof. Let 
a\ by Cl 
A= a2 bo C2 
a3 b3 C3 


Without loss of generality, we may assume 


a2 by C2 
B= ay b, Cl 
a3 b3 C3 


Then by definition, 


det A = aybrc3 + anb3c, + a3b\c2 = a3brc1 = a, b3c2 = arb, c3 


det B = ayhic3 + a, b3c2 + a3b2c1, — a3b C2 — arb3c, — aybrc3 


Comparing det A and det B, we obtain det B = — det A. (il 
Example 148. 
(a). If Row | and Row 2 are interchanged, then the determinant of one matrix is 


the negative of the other matrix. 


1/41/2 1 1 05 
1 0 5/=-|1/41/2 1 
5 3 -4 5 3 -4 
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(b). If Row 2 and Row 3 are interchanged, then the determinant of one matrix is 
the negative of the other matrix. 


1/41/2 1 10 5 
10 5/=|5 3 -4 
5 3 —4| |1/41/2 1 


Example 149. 
1/41/2 1 1/21/4 1 
1 O 5;/=-] 0 1 5} (Column 1 and Column 2 are interchanged) 
5 3 —-4 3 5 -4 


Problem Set 6.6: 


By using the elementary row operations and the properties of determinants, solve 
the following problems. 

b+ca-ca-—b 
1. Find|b—c a+c b—a\. (Answer: 8abc) 

c-bc-aa+b 


l pu pul lmn 
2. If |m q v| = 8, then find the value of |r t n| + |p q rj. (Answer: 0) 
nrt qum uvt 
abe a+3b—3c 4b+ 6c c 
3. If|p q r| = 4, then find the value of | p + 3g —3r 4q + 6r r\J. 
mnk m + 3n — 3k 4n+ 6k k 
4. By using the properties of determinants, simplify the following, 
aa’ be 
bb? ca} (Answer: (a — b)(b — c)(c — a)abc) 
cc? ab 


5. By using the properties of determinants, simplify the following, 


laa 
1563) (Answer: (a+ b)(b+c)(c +a)\(at+b+c)) 
lee 


6. By using the properties of determinants, simplify the following, 


laa 
1b b?| (Answer: (a — b)(b —c)(c — a)) 
Lee 
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la, a2 + a3 
7. Show that |1 a2 a3 + a\| = 0. 
1 a3 a, +a2 


6.7 Application of Determinants (Line and Area) 


Overview Equations of lines and planes are often used in elementary mathematics. 
However, the use of matrices can result in various forms (such as two point forms 
for lines and areas of triangles) that highlight conceptual motivation for these topics. 
Let’s look at the line passing through two distinct points (x1, yi) and (x2, y2) and 
write ax + by +c = 0 as the equation of the line. Because the distinct points are 
on the line, (x1, y,) and (x2, y2) should satisfy this equation. Thus, 
ax+by+c=0 
ax; + by, +c=0 
ax, + by2 +c = 0. 
We now rewrite the above linear system of linear equations as a linear system in the 
unknown a, b,c, obtaining 
xa+yb+c=0 
xjat+yb+c=0 
xa + yob+c=0 


This is a homogeneous system with unknown a, b and c. It has a nontrivial solution 
if and only if 


x yl 
x,y 1 = 0, 
X2 yo 1 


1.e. 
(v1 — ya)x — (41 — X2)¥ + (X12 — Y1xX2) = O. 


Theorem 31 (Two-Point Form of the Equation of a Line). The equation of the 
line passing through two distinct points (x1, y1) and (X2, y2) is given by 


x yl 
x,y, 1] =0. 
X2 y2 1 
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Example 150. Find the equation of the line passing through the points (1, 4) and 
(—1,5). 


Solution. Applying the above theorem produces 
x yl 
1 41);=0. 
-151 


To evaluate this determinant, we expand by cofactors using the first row, 


41 eat eee 
= =—x-2y4+9= 
15 ‘ an 1 he : a an, 
ie.,x+2y—9=0. // 


Exercise 94. Find the equation of the line passing through the points (4,7) and 
(—6, 4). 


The following corollary is the consequence of the theorem which uses the notion 
of collinearity 


Definition. Three points or more are COLLINEAR in R” if all of the points are on 
the same line. 


Corollary 10 (Test for Collinear Points in the x, y-Plane). Three points (x1, y1), 
(x2, ¥2) and (x3, y3) are collinear if and only if 


xi yi 1 
X2 y2 1 = 0. 
x3 y3 1 


Example 151. Determine whether points (1, 2), (4, 5), (6, 7) are collinear. 
Solution. Since 

121 

451}=5+28+ 12—(30+7+ 8) =0, 

671 
they are collinear. // 
Exercise 95. Determine whether points (—1, —3), (—4, —7), (2, 2) are collinear. 


In the event that three points are not collinear, they can be viewed as the vertices 
of a triangle with a well defined area. In order to calculate this area directly without 
trigonometry, one would need to use the pythagorean theorem, the distance formula, 
and some geometric properties concerning triangles. On the other hand, the area can 
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be calculated using only the information given by the coordinates of the three points 
using determinants. 


Theorem 32. The area of the triangle APQR whose vertices are P(x,,y), 
O(X2, y2), R(x3, y3) is 


X{ yy 1 
eS as X2 yo lj, 


X3 y3 1 


where the sign + is chosen to give a positive area. 


Proof. From the diagram below, we see that 


Area of A POR = Area of trapezoid RR’ Q'O — Area of trapezoid RR’ P’ P 
— Area of trapezoid PP’Q’O 


_ 02 = %3)02 + ¥3) — Or = x3) + 3) — G2 = x)O1 + Y2) 
2 2: 2 


II 


1 
3 1y2 + X23 + X31 — X13 — X2V1 — X32) 


pial 
= 5 |*2 Ya 1). 
x3 ¥3 1 


Note if P is above the line QR, then the area is the negative of the determinant. Mi 


Example 152. Find the area of the triangle whose vertices are (1,1), (2,1) and 
(—3, 4). 
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Solution. We simply need the determinant 


111 
2 11)=-. 
—341 


// 


Exercise 96. Find the area of the triangle whose vertices are (—1, 0.5), (2, 1) and 
(—2,3). 


Problem Set 6.7: 


For Problems 1-4, determine whether or not the points are collinear and in the event 
that they are, find the equation of the line containing them. 


1. (4,17), (6, —3), and (2,5). 

2. (3,7), (12, 10), and (30, 16). 

3. (3,12), (1, 1), and (2,7). 

4. (0,3), (2,0), and (—4, 8). 
For Problems 5—7, determine a point P(x, y) such that P along with the given 
points Q and R form a triangle with the specified area. 

. O(3,8), R(7, 11) and the area is 12 square units. 

. O(0,0), R(8, 0) and the area is 32 square units. 

. O(5,7), R(—1, 7) and the area is 6 square units. 

. What happens if one tries to apply Theorem 31 when the three points P, QO, and 
R are collinear? 


aeanNnaAN 


(a) Apply Theorem 31’s area equation to the sets of collinear points from 
Problems 1-4. 
(b) Make a conjecture about the value of the expression: 


1 xi yi 1 
5 2 Y2 | 
x3 y3 1 


in the event (x1, 1), (X2, y2), and (x3, y3) are collinear. 
(c) Draw a diagram similar to the one in the proof of Theorem 31 to support 
your claim and write a paragraph justifying it. 


9. The following points, P, Q, and R are collinear. What is the value of a in terms 
of b if P = (3,5), O = (-1,4), and R = (a,b)? 


Selected Answers 


Section 1.1 


1. (a) 1,3,5,—9, 207 


(b) The set O is the set of all integers that are one less than an even number. 


(c) No, there is no contradiction between the two definitions. 
11l.a=2,b=2 
14. x =60rx = —2 
17. When n = 1, we see that p(1) = 1+ 1+ 17 = 19 whichis prime. 


Section 1.2 


Section 1.3 


1. (a) The quotient is 3 and the remainder is 12. 
(b) 63 — 17 = 46, 46 — 17 = 29, 29-17 = 12. 
4. (a) gq=-3,r=2 
(b) You never reach —7. 
(c) You lose the negative sign. 
8. 2 
11. 6 
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218 Solutions 
Section 1.4 


1. When c = 0, a must be 2, 5, or 8. When c = 5, a must be 0, 3, 6, or 9. 
9.a=Oandb=7 
14. 7” = (64+ 1)" = 6k + 1,s0 7” —1 = 1—1 = 0 on division by 6. 
16. 20502, 21312, 22122, 23832, 24642, 25452, 26262, 27072, 27972, 28782, 
29592. 


Section 1.5 


1. (a) (9A)12 
(b) (T))2 
(c) (101001110)> 
(d) (10001001), 
3. (a) (10001111), 
(b) (100011), 
(c) (1001). 
(d) (10001), 


Section 2.1 


1. 2 
4, 154 
11. 7 weeks 
13. (a) (120, 10), (60, 20), (40, 30) 
(b) 120, 60, 120 


Section 2.2 

1. 23 

8. (11,121), (55,77) 
Section 2.3 


1. bandc 
3. x = 162, y = 36 is one solution so x = 162 + 22t, y = 36 + 5t for integer ¢ 
yields all solutions. 
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Section 2.4 


1. (a) 45 
(b) 34 
(c) 28 
4. (a) False. n = 2 
(b) False.n = 2 andm = 1 
(c) True 
12.2% 


Section 2.5 


9. (14) 


Section 3.1 


1. (a) {Of} 
(b) {1,-I} 
5. Even and odd 


Section 3.2 


1. (a) 5,5 
(b) 10 
(c) 3 
1Div. 1 = 1,24+242=6,343=64444444=16 


Section 3.3 
Section 3.4 


Section 4.1 


1. (x — 1x + 1)(5x — 2) 
10.a=5,b=-1 
11. -1 
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Section 4.2 


3. (a) q(x) = x-1,r(x) =3 
(b) q(x) = 4x+4+ 1,r(x) =0 
5. r(x) =x? +x4+1 


Section 4.3 


lorry) = es 
5.a=3,b=1. 
9. r(x) = 34-1873 
20. a =0,b=2 


Section 4.4 


1. (a) g(x) = 3x? - 4x -—Z, r(x) = Bx- 2 


(b) q(x) = x3 4-207 + 5x -Fr@=-pr-47 


Zq 
5. (a) q(x) = —x* — 4x +5, r(x) = —24x 4+ 12 


(b) g(x) = 3x7 + x, r(x) =0 


Section 5.1 


1. No 

2. Yes 

7. t = —35 

11. k =—-4,t = —4 


Section 5.2 


ll 

| 
rad 
vin 
Nn 


Solutions 


Selected Answers 
Section 5.3 
1. (a) 4—2x + x7, x© + 5x5 — 6x4 + 8x3 + 40x? — 48x 
(b) —6 +x + x7, x7 —3x° + 6x4 — 49x73 + 24x? + 147x — 126 
Section 6.1 


1. (a) Not possible. 
(b) Not possible. 
(c) [82 12] 


(c) None exists. 


1 1 
—1 zi z ir 
Qe =|=5 9. 3 
eee ae 
2 2, 2 
—62 3 _27 
(ec) Et = ad = 1s 
5 5 5 
2 0 1 
Section 6.2 
2. 2=07=S—, 
1 14 
3. %=30=5 
9 x=1,y=0 
12. x = }In¥, y = log, 4 
Section 6.3 
14 3 
1. (a) A= 5914 


—23 1 


221 


222 


2-35 
(b) A= 1 43 
—2 31 
(c) It depends on the order of operations. 
2x= z, y= = 
Section 6.4 
127 18 49 
lx = Y= 12 = 79 


2. No solutions. 
3. x=-l+ity=14+3t,z2=1 


Section 6.5 


1. (a) GCD(260, 90) = 10, 10 = (-1) - 260 + (3) - (90) 
(b) GCD(1785, 340) = 85, 85 = (1) - 1785 + (—5) - 340 
(c) GCD(—616, 286) = 22, 22 = (6) - (—616) + (13) - 286 


Section 6.6 


Section 6.7 


1. No 
2. Yes. y = ix —2 
6. P(0, 8) 


Solutions 
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